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ABSTRACT 

Using the wide multi-band photometry available in the COSMOS field we explore the host 
galaxy properties of a large sample of Active Galactic Nuclei (AGN) (~1700 objects) with Lboi 
ranging from 10 43 to 10 47 erg s obtained by combining X-ray and optical spectroscopic se- 
lections. Based on a careful study of their Spectral Energy Distribution (SED), which has been 
parametrized using a 2-component (AGN+galaxy) model fit, we derived dust-corrected rest- 
frame magnitudes, colors and stellar masses of the obscured and unobscured AGN hosts up 
to high redshift (z<3). Moreover, for the obscured AGN sample we also derived reliable star 
formation rates (SFRs). We find that AGN hosts span a large range of stellar masses and SFRs. 
No color-bimodality is seen at any redshift in the AGN hosts, which are found to be mainly 
massive, red galaxies. Once accounting for the color-mass degeneracy in well defined mass- 
matched samples, we find a residual (marginal) enhancement of AGN incidence in redder 
galaxies with lower specific star formation rates, and we argue that this result might emerge 
because of our ability to properly account for AGN light contamination and dust extinction, 
compared to surveys with a more limited multi-wavelength coverage. However, since these 
color shifts are relatively small, systematic effects could still be held responsible for some of 
the observed trend. Interestingly, we find that the probability for a galaxy to host a black hole 
growing at any given "specific accretion rate" (i.e. the ratio of X-ray luminosity to the host 
stellar mass) is almost independent of the host galaxy mass, while it decreases as a power-law 
with Lx/M*. By analyzing the normalization of such probability distribution, we show how 
the incidence of AGN increases with redshift as rapidly as ( 1 +z) 4 , in close resemblance with 
the overall evolution of the specific star formation rate of the entire galaxy population. We 
do provide the analytic fitting formulae that describe the probability of a galaxy of any mass 
(above the completeness limit of the COSMOS survey) to host an AGN of any given specific 
accretion rate as a function of redshift. These can be useful tools for theoretical studies of the 
growing black holes population within galaxy evolution models. Although AGN activity and 
star formation in galaxies do appear to have a common triggering mechanism, at least in a 
statistical sense, within the COSMOS sample we do not find strong evidence of any 'smoking 
gun' signaling powerful AGN influence on the star-forming properties of their hosts galaxies. 

Key words: Surveys, Catalogues, Galaxies:active, Galaxies: fundamental parameters, Galax- 
ies: evolution 
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1 INTRODUCTION 

Most of the times since their discovery, more than 40 years ago, 
Active Galactic Nuclei (AGN) powered by accretion onto Super- 
massive Black Holes (SMBH) have been considered rare and exotic 
objects with peculiar (and extreme) astrophysical properties. In the 
past decade, studies in the local Universe established the presence 
of supermassive Black Holes in the nuclei of virtually all galaxies 
with a bulge/spheroidal compone nt (e.g. iKormendv & Richstona 
ll995l : lKormendv& Bendeifeplll) ; changing dramatically the per- 
ception of this class of objects. Since cosmological SMBH growth 
is mostly due to accretion of matter during their active phases 
dSoltarJ fl982h . we are led to conclude th at most bulges went 
through a phase of strong nuclear a ctivity ^Marconi et al. I [20041 ; 
iMerloni & Heinz 2008; Sh ankaj2009l) . Moreover, as the energy re- 
leased in the process of accretion on the SMBH can be higher than 
the total binding energy of a massive galaxy, AGN may represent 
a key ingredient of the formation and evolution of all galaxies. If, 
and how, the nuclear black holes influence their host galaxies, and 
vice- versa, became a major focus in studies of structure formation 
and evolution. 

Indeed, many observational discoveries do support the no- 
tion of a close link between SMBH growth and the assem- 
bly of the host galaxy. The most crucial ones are the fol- 
lowing: (i) The tight correlation between the central SMBH 
mass a nd various properties of thei r host galaxies i.e. lumi- 
nosity: dKormendv & Richstonel 1 19951 ; iMcLure & Dunlod 1200 ll ; 



iMarconi & Huntl 120031), stellar mass jMagorrian et all 19981) 



and velocity dispersion jFerrarese & Merritlll200(]|: iGebhardt et"al] 



l2000l : lTremaine etai]|2002l : IKormendv & Bended l201lh . (ii) As 
populations, SMBHs and galaxies seem to evolve with redshift in 
a similar way. In fact, the AGN evolution is luminosity-dependent 
with low-luminosity AGN reaching the peak in their space den- 
sity later in the history of t he Universe (i.e. a t lower redshift) 
than h igher luminosity AGN dMivaii et alj 
20051: lUeda et alj |2003|; iFiore et al.l I2603F 



20001: lLa Franca et alj 



Cirasuolo et al 



20051 ; 



Hasinger et alj |2005; Bong iorno et alj|2007l) . This pattern is sim 
ilar to the so-called "cosmic downsizing" of star forming galax 



ICS dCowieet all 19961: iMenci et al ]|2008l) se en als o in the spheroid 
galaxies dCimatti et alj|2006t iThomas et al ]|20h3). (iii) The shape 
of the integrated AGN activity over cosmic time is similar to the 
one of the global star formation rat e (SFR) with a peak at z~2 and 
a rap id decline at both low er (e.g. iDickinson et al. l l2003l ; lMerlonil 



20041; lHopkinset al.ll2007l;IShankaret alj 120091) and higher (e.g . 
Franceschini et alj 1 19991 ; IWilkins et al.ll2008l : iBrusa et alj|2009al) 
redshift. (iv) Finally, luminous AGN show a correlation between 
the AGN bolometric luminosity and the star formation (SF) of the 
host galaxy o ver more than five orders of magnitude in luminosity 
jNetzedl200^) , albeit with a substantial scatter; possible deviation 
from such a behavior have however been reported for lower lumi- 
nosity AGN JShao et al.|[2oTol ; iMullanev et alj|2012l : Is antini et alj 
l2012h . 

While the evidences of a tight link between the formation of 
galaxies and the growth of the SMBHs at their centers are many 
and clear, the physical processes behind this interplay remain un- 
clear. In particular, the fundamental question of whether AGN- 
driven feedback processes are ultimately responsible for determin- 
ing global properties of the galaxy population (e.g. luminosity func- 
tions, color-magnitude distributions and the evolution thereof) or, 
on the contrary, they happen to be triggered and fueled as a by- 
product of star-formation activity and morphological evolution of 
their hosts still remains unanswered. This is of fundamental im- 



portance for understanding both galaxy assembly and the accretion 
density of the Universe. The work we present here is indeed mo- 
tivated by the desire to obtain robust, and statistically significant, 
constraints on the possible scenarios for the co-evolution of galax- 
ies and AGN. 



A slew of models (e. 


g. ISomerville et alj|200ll; iGranato et al. 


2004 


Monaco & Fontanot 


2005l:ICroton et all2006l;lHoDkins et al. 


2006 


Cen&Chisari 2011 


; ISpringel et alj|2005l: Schawinski et al. 


2006 


have been developed in recent years to explain this co- 



evolution and to describe the main mechanism that fuel the central 
SMBH and build the galaxies' bulge. Some of these semi-analytical 
models and hydrodynamical simulations (e.g. ISpringel et aT]|2005l : 
iHopkins et al ] 120061 ; IMenci et alj l2008h invoke major mergers 
of gas-rich galaxies as the main fueling mechanism. Alterna- 
tive mechanisms have been discussed in the literature, includ- 
ing mmor-mergers (e.g . Ilohansson et alj[2009h . disk instabilities 
(e.g. iGenzel et alj|2008l) . and recycled gas from dying stars (e.g. 
ICiotti et al]|2010l) . Different fueling mechanisms are usually asso- 
ciated to different luminosity r anges e.g. major mer gers are invoked 
to trigger bright quasars (e.g. IHopkins et alj|2003) . while the role 
of major mergers in t riggering more typical (~ L.) AGN is called 
into question (see e.g . lCisternas et alj201ll ; [Schawinski et al .120111: 
iKocevski et al.ll2012f) . On the contrary, secular processes seem to 
be enough to fuel low luminosity AGN i.e. Seyfert-like objects 
{ IHopkins et al]|2008l ; lAllevato et alj|201 lh . Due to the large mis- 
match in physical scales between AGN and galaxies, all these mod- 
els have to rely on specific assumptions regarding the mechanism 
responsible for the link between the nuclear activity, which releases 
most of its energy on the scale of few Schwarzschild radii Rs (~ 
10~ 5 pc), and the stellar population on much larger scales (~ a few 
kpc, some ~ 10 8 times Rs). This mechanism is in most cases an 
energetic feedback from the central engine that deposits the en- 
ergy liberated by th e accretion proces s within the host galaxy, or 
its dark matter halo (Silk & Reesl 19981) The physical description of 
the feedback itself and its effect on the AGN host galaxy is, how- 
ever, very different among different models. 

In particular, due to the complex interplay between gas in- 
flow, outflow and nuclear obscuration during active AGN phases, 
a correct and complete identification of unobscured, obscured, and 
highly obscured AGN at all redshifts (and especially in the z = 1 - 3 
interval, where most of the feedback is expected to happen) is there- 
fore crucial for a comprehensive understanding of the still little 
explored phase of the common growth of SMBHs and their host 
galaxies. 

Generally speaking, two different approaches can be followed 
to test the predictions of different theoretical models of AGN feed- 
back and to constrain the physical mechanisms at play: a detailed 
study of the physical properties of relatively small but well de- 
fined samples of selected sources or a statistical investigation of a 
large sample of AGN which includes different sub-classes. Within 
the COSMOS survey, t he first approach has been followed in 
our previous papers (i.e. Merloni et alj|20 id : iMainieri et aljfeoi ll : 
ICisternas et ailuoTll ; iLusso etld] 201 lh . while the latter approach 
is adopted here. 

To study growing SMBHs and their influence on the host 
galaxies over different periods of their cosmic evolution, one needs 
to compile a sample of AGN (both unobscured and obscured) as 
complete as possible, spanning a wide range in luminosity and 
redshift. X-ray surveys have been proved to be the most efficient 
way to compile nearly un biased samples of Compton Thin AGN 
jBrandt & Hasingej2005l). but they may miss Comp ton thick AGN 
(e.g. IComastri & Brusall2008l : lLa Massa et alj|2009l) . On the other 
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hand, the optical spectroscopic selection, even though allowing a 
straightforward selection of unobscured AGN and being less bi- 
ased against Compton Thick AGN, can fail to select those AGN for 
which the nuclear optical/IR light is diluted by their host galaxy. To 
overcome these limitations, we decided to consider and combine 
X-ray and optically selected AGN samples compiling a highly ho- 
mogeneous and representative sample of obscured and unobscured 
AGN selected in the COSMOS field over a wide redshift baseline 
(0<z<4). 

Other select i ons criteria (e.g. fr om IR color-color diagrams, 
Stern et alj|2005l: iDonlev et alj|2012l . or from radio surveys, e.g. 



Smolcic et alj|2008) may provide additional AGN and are needed 



in order to get a fu ll picture on the AGN-galaxy co-evolution (e.g. 
Iffickox etal.1 120091) . However, since X-ray and optical selections 
make up more than 50% of the full AGN population in COSMOS 
(Brusa et al. in prep) for the purpose of this paper we will focus on 
only these two methods quantifying, whenever relevant, the incom- 
pleteness due to the not inclusion of the other samples. 

The major observational challenge in any comprehensive 
study of the AGN-galaxy co-evolution is the accurate separation of 
the AGN and galaxy emission components, at all optical-IR wave- 
lengths. This is a crucial step for a number of reasons: depending 
on the intrinsic spectral energy distribution of the nuclear (AGN) 
and of the stellar light, and of their respective level of extinction, 
inaccurate de-blending might not only hamper any precise deter- 
mination of the galaxies' physical properties, but also mask AGN 
signatures and bias our view of the SMBH growth. 

To this end, we used a two-component Spectral Energy Distri- 
bution (SED) fitting procedure. The observed SED is fitted with a 
large g rid of models based on a combination of the lRichards et alj 
(2006) AGN template and several host-galaxy models (i.e. syn- 
thetic srjectra_£reated_^rarijhe stellar population synthesis mod- 
els o f iBruzual & Charlotll2003l) . Given the wide multi-wavelength 
coverage available in the COSMOS field, this fitting technique al- 
lows to decompose, typically with a high level of confidence, the 
entire SED into nuclear AGN and host galaxy components and to 
derive robust measurements of the host galaxy physical properties, 
e.g. rest-frame colors, stellar mass, K-band luminosity and star for- 
mation rate. The strength of the SED fitting method is that, given 
sufficiently wide photometric coverage, it is applicable to all AGN, 
obscured and unobscured, independent of their luminosity. In par- 
ticular, once a comprehensive set of templates for the SED com- 
ponents is chosen, the method can be applied (almost) blindly to 
any detected object in a multi-wavelength survey, irrespective of 
the specific selection criteria, reaching an accuracy that depends 
crucially on the number of bands and depth of the available photo- 
metric catalogs. In this respect, COSMOS is a uniquely suited field 
for our investigation. 

The structure of the paper is as follows: In Section [2] we will 
introduce our sample, before proceeding to a discussion of the 2- 
component SED fitting method in Section [3] In Section [4] we will 
discuss the physical properties of the AGN host galaxies (i.e. col- 
ors, masses and SFRs) compared to normal galaxies focusing in 
Section |4~3l and |4~4l on the incidence of the AGN as a function of 
their host properties i.e. mass and star-formation rate, respectively. 
A final discussion and a summary of our main results are presented 
in Section[5]and[6] Additionally, in Appendix|A|we discuss the ef- 
fect on the derived host galaxy properties if the AGN component is 
not properly subtracted, while Appendix [B] shows a detailed com- 
parison between two SFR indicators i.e. UV-optical SED fitting and 
the FIR bands based method. In Appendix [C] two tables with the 
AGN catalog and the parameters derived from the SED fitting (e.g. 



rest-frame magnitudes, masses and SFRs) can be found together 
with a short explanation of the columns. These tables show only 
16 lines as an example of the ones published online. Throughout 
this paper, we use the standard cosmology (Q_ m =0.3, Ci A =0.7 ', with 
Ho=70 km s -1 Mpc~'). Magnitudes, if not differently specified, are 
AB magnitudes. 



2 AGN SAMPLES FROM THE COSMOS SURVEY 

In order to construct a sample of AGN as complete as possi- 
ble, including both obscured and unobscure d objects, we first se- 
lected XMM-COSMO S point-like sources l lHasinger et~al1l2007l : 
ICappellutie7aI]|2009h , to which we added a smaller sample of 
purely opticall y selected AGN from the zCOSMOS bright spectro- 
scopic suryeyjjLilly et al. 2009), spanning a lower AGN luminosity 
range dBongiorno et al.l2010l Mignoli et al. in prep). In this section, 
we give the details of our sample selection. 



2.1 The X-ray selected sample 

The XMM-COSMOS catalog includes -1800 point-like X-ray 
source s. The X-ray catalog has been presented in lCappelluti et al] 
d2009h , while the optical ident ifications and mu lti-wavelength 
properties have been discussed in iBrusa et alJfeOldR The sources 



of the XMM-COSMOS catalog of lBrusa et all tepid) have been 
classified as normal galaxies, obscured and unobscured AGN ac- 
cording to their properti es, i.e. X-ray lumin osities, spectra and 
multi-wavelength SEDs dSalvato et alj 1201 ll) : here we consider 
only those sources classified as AGN, both unobscured and ob- 
scured, as we describe below. 

Optical spectra are available for more than half of the sam- 
ple from d i fferen t instru ments: VIMOS/VLT ( zCOSMOS project 
Lilly et alj |2007|. 120091), IMACS/M agellan jTrump et al] l200l 
20091) , MMT JPrescott et"ai1 12006|) , and DEIMOS/Keckll (P.I. 
Scoville, Capak, Salvato, Kartaltepe, Mobasher). 

From the XMM-COSMOS subsample for which spectra 
are available, we identified 430 sources which show broad 
(FWHM>2000 km s~') emission lines in their optical spectra, 
which we classify as type-1 (unobscured). Further 402 spectro- 
scopically confirmed AGN were instead classified as type-2 (ob- 
scured), by making use of both their spectral properties and X-ray 
luminosities. In particular, the type-2 AGN class includes: (a) all 
the sources that do not show broad emission lines and for which the 
ratio between their high-io nization narrow lines indicates AGN ac- 
tivity IBaldwin et al.lll98ll) and (b) in case the high ionization lines 
are not detected in the observed wavelength range, all the objects 
without broad lines and with rest-frame (2-10 keV) X-ray luminos- 
ity greater than 2xl0 42 erg s -1 , 

For about half of the XMM-COSMOS sources (723 objects), 
only photometric redshifts are available (the "photo-z" AGN sam- 
ple), which are however very ac curate with (Ta,-/< i+.-) ~0.01 5 and a 
fraction of outliers of just 5.8% dSalvato et al1l2009l . l201 ll) . Thus, 
the XMM-COSMOS sources for which no spectra are available 



1 In this paper we use an improved version of the XMM-COSMOS 
identifi cations catalog with respect to the one published in IBrusa et al] 
the main dif ference being the a vailability of new photo- 
metric redshifts from ISalvato et alj 4201 lh . The updated version of 
the XMM-COSMOS identification catalog can be downloaded here: 
http://www.mpe.mpg.de/XMMCosmos/xmm53_release/ 
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Obscured AGN sample Unobscured AGN sample 




Figure 1. Schematic view of the analyzed sub-samples for obscured (left) and unobscured (right) AGN. Different circles represent the different methods used 
to select them. Blue and red circles correspond to X-ray unobscured and obscured AGN, green and yellow to optically selected AGN using the diagnostic 
diagrams and the [Ne v] line. Open and filled symbols are for photometric and spectroscopic redshift whose origin is highlighted with the slices of different 
color gradations. The combined analyzed sample consists of 1702 AGN: 602 type-1 AGN (of which 430 with spectra) and 1 100 type-2 AGN (of which 549 
with spectra). 
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Table 1. Analyzed sub-samples of obscured and unobscured AGN. Bold 
numbers indicate the total number of sources while the rest highlight the 
sources in common between two or more classification methods. The com- 
bined analyzed sample consists of 1702 AGN: 602 unobscured AGN (of 
which 430 with spectra) and 1 100 obscured AGN (of which 549 with spec- 
tra). 



have been classified as unobscured type-1 in 172 cases, and as ob- 
scured type-2 in 551 cases. In practice , the source cl a ssifica tion for 
"photo-z" AGN was performed as in ISalvato et al.l d201ll) on the 
basis of the templatq 2 ] that best describes their SED, and applying 
the same threshold in X-ray luminosity (ip-io]keV > 2 x 10 42 erg 
s" 1 ). 

Summarizing, the XMM-COSMOS sample analyzed in this 
work comprises 602 unobscured type-1 AGN (430 with spectro- 
scopic redshift and broad emission lines; 172 with photometric red- 
shift) and 953 obscured type-2 AGN (402 with spectroscopic red- 
shift and 551 with photometric redshift), for a total of 1555 X-ray 
selected AGN. 



2 chosen from a library of hybrid template obta ined combining the ob- 
served galaxy templ ates from Pollett aet alj fe007l) with different levels of 

J 



AGN contamination ISalvato et alj201 



2.2 Optically selected Seyfert-2 galaxies: diagnostic 
diagrams 



For type-1 AGN the X-ray and optical selection match very well, 
i.e. in addition to the 430 spectroscopic broad line AGN with XMM 
detection, there are only 25 (~6%) objects purely optically selected 
from the zCOSMOS bright survey which are not detected in the X- 
ray band by XMM (the number goes down to just 14, ~3%, if we 
consider also Chandra detections). This is not the case for obscured 
type-2 AGN, for which a more substantial fraction (>25%) can 
be selected from optical surveys showing no X-ray counterparts. 
Moreover, while the redshift and luminosity distribution of X-ray 
and optically selected type-1 AGN is very similar, for type-2 ob- 
jects this is not the case and different bands can allow us to retrieve 
different sub-classes of sources. For this reasons, we include in this 
study also obscured type-2 AGN selected purely on optical prop- 
erties. Since optically selected AGN (Seyfert-2 galaxies) tend to 
have much lower luminosities (Fig. [Tol this allows us to enlarge 
the analyzed luminosity range. 

The sample of optical type-2 AGN was selected in the follow- 
ing way: from the final 20 000 VIMOS/VL T zCOSMOS bright, and 
in analogy with the procedure adopted inlBongiorno et alj j2010h 
for the first 10 000 spectra (lOk. lLillv et alj2009h . we have selected 
392 sources using the standard diagnostic dia grams ([O m]/HB ver- 
sus [Nn]/Ha and [Om]/H/J versus rS nl/Ha JBaldwin et aljfl98ll , 
BPT diagram) to isolate AGN in the redshift range 0.1 5< z <0.45 
and t h e diagnostic diagram [ O m]/H8 versus [Oii]/H/3 dRola et al] 
1 19971 : lLamareille etail |2004 ' 'blue" diagram) to extend the se- 
lection to higher redshift (0.5< z <0.92), These diagnostic dia- 
grams allow the classification of emission line objects as Seyfert-2 
galaxies, candidate Seyfert-2 galaxies, and LINERs. Here we in- 
clude only the 105 objects classified as se cure Seyfert-2 galaxie s 
(the LINER sample is studied separately in lTommasin et alj|2012h . 
Of these 105 AGN, 18 have X-ray counterparts and therefore are 
already included in the X-ray selected sample described above. 
Hence, from the optical selection based on the spectral diagnostic 
diagrams, we isolated 87 pure optically selected Seyfert-2 which 
do not show any X-ray emission at the depth of our XMM observa- 
tions. 
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2.3 Optically selected obscured AGN: the [Ne v] emission line 
sample 

A relatively new method to select obscured AGN in the optical sur- 
veys uses the detection of [Ne v] line whose presence is a distinc- 
tive feature of AGN activity (see e.g. iGilli et alj fJoiOi), Using this 
criterion on the same 20000 zCOSMOS bright spectra, Mignoli 
et al. (in prep) identified 95 obscured type-2 AGN, 1 1 of which 
were already identified through the emission line diagnostic dia- 
grams. Of the remaining 84 sources, 24 have X-ray counterparts 
and are hence already included in the X-ray selected sample de- 
scribed above. From the [Nev]-based optical selection, we thus 
isolated additional 60 pure optically selected Seyfert-2 which do 
not show any X-ray detection. 

In total, the optically selected obscured AGN amount to 189 
sources, of which 147 are purely optically selected with no XMM 
detection^. Since both these two criteria are based on spectral fea- 
tures, the number of spectra available for obscured AGN is artifi- 
cially higher than for the unobscured ones. 



2.4 The total AGN sample and the parent galaxy sample 

The final combined sample analyzed in this paper consists of 1702 
AGN: 602 type-1 AGN (of which 430 with optical spectra) and 
1100 type-2 AGN (of which 549 have spectra). A graphical rep- 
resentation of the sample and its basic substructure, highlighting 
classification, the method used, and overlaps among the various 
sub-samples, is shown in Figure [T] and in table [T] The color code 
adopted in Fig. Q] will be used throughout the entire paper: blue and 
red colors will denote X-ray unobscured and obscured AGN, re- 
spectively; green those Seyfert-2 galaxies selected optically using 
the diagnostic diagrams, and yellow the obscured AGN selected 
via [Ne v] line emission. Open and filled symbols are used instead 
for photometric and spectroscopic redshifts, respectively. The full 
sample can be found in the online material. An example of the pub- 
lished table is given in Table |C 1 1 

Since the aim of this work is to study the AGN host galaxy 
properties and compare them to normal galaxies, throughout this 
paper we will use the IRAC selected galaxy sample in the COS- 
MOS area as comparison "parent" galaxy sample. For the whole 
galaxy sample pho tometric redshifts , masses and star-formation 
rates are available dllbert et aT ||20l3). Moreover, in order to dis- 
entangle multiple dependencies and thus isolate the real differ- 
ences in the properties of galaxies hosting an AGN from those 
that do not, we have also created two matched parent sam- 
ples: a "mass-matched parent sample" and an "i-band-apparent- 
magnitude-matched parent sample". In both cases we matched each 
AGN with 3 non-AGN chosen from the whole IRAC sample. For 
the mass-matched sample we considered the (z, M) plane and for 
each AGN with a given redshift and host galaxy stellar mass (see 
section |3".5.1| below) we extracted the 3 non-AGN objects from the 
parent sample which lie closest in the (z, M) plane to the AGN al- 
lowing as maximum difference a factor of 10 in Mass. The same 
procedure was followed to build the I-band matched sample, this 
time working on the (z, i) plane, allowing as maximum difference a 
factor of 4 in luminosity. 



3 We note here that 7 of the 147 purely optically selected AGN are actually 
detected in the d eeper Chandra ob servation covering the central part of the 
COSMOS field fclvis et aflkoogl Civano et al. in prep). In order to keep 
selection effects under control and exploit the full 2 deg 2 area of COSMOS, 
in this paper we limit the X-ray analysis to the XMM-COSMOS sample. 



2.5 Multi-wavelength observations 

The Cosmic Evolution Survey (COSMOS) is a multi- wavelength 
observational project over 1.4x1.4 deg 2 of equatorial field cen tered 
at (RA, DEQnooo = (150.1083, 2.210) dScoville et afll2007l) . The 
COSMOS field is characterized by an unprecedented deep multi- 
wavelength coverage, and in order to compile the SED for the AGN 
sample described above, we exploited such a large number of multi- 
wavelength data available in the field. For most of the sources we 
have constrained the SED using 14 different bands that encompass 
optical to MIR wavelengths. More specifically, fo r all our sources 
we u sed 6 SUBARU bands (B, V, g, r, i, z; from Taniguchi et alj 
20071). and the CFHT/ U-, J- and K-bands dCapak et all 120071: 



McCracken etaill2010h . All but a few (~97%) of our sources are 



detecte d in all 4 Spitzer/IRAC bands (3.6pm , 4.5/im, 5.8/um and 
8.0«m: ISanders et alj|2007l lllbert et al.il201fj) and -80% of them 
are al so detected in the 24pm Spitzer/MIPS band dLe Floc'h et alj 
120091) . For most of the sources we can thus constrain the SED in 
a very large wavelength interval, ranging from ~3800 A(Ucfht) to 
24pm (Spitzer/MIPS). Moreover, for a few percent of the sample 
we can also rely on longer wavelength detections. In particular, for 
the X-ray sele cted sample, 109 so urces have detection at 70 /jm 
(Spitzer/MIPS. ISanders et alj|2007l) . 216 are visible at 100//m and 
201 at 160/jm (77 of them are detected in both wavelengths) by 
the Herschel Space Observato ry as part of the PACS Evolutionary 
Probe (PElfl; iLutz et alj20lTh guaranteed time key program. 



3 SPECTRAL ENERGY DISTRIBUTIONS OF COSMOS 
AGN 

In this section, we describe the main tool in our analysis, namely 
the study of the overall Spectral Energy Distribution of the entire 
AGN sample. The approach we follow is to consider, with as little a 
priori assumptions as possible, each and every observed SED in the 
sample as the result of the superposition of the nuclear AGN emis- 
sion and of the galactic stellar light entering the fixe d apertures of 
3" dia meter used to derive all optical/IR photometry dSalvato et alj 
|2009|) . 

The typical SED of a pure QSO, shown in the right panel of 
Fig. [2] is characterized by two bumps in the UV and IR r egimes 
dSanders et aklll 9891: IeI vis et al.ll 19941 ; iRichards et a"i]|2006l) which 
create a dip at around \pm. The UV bump is interpreted a s ther- 
mal emission from the accretion disk dCzernv & Elvisll987l) . while 
the IR bump is thought to be due to absorption of intrinsic AGN 
UV, X-ray and optical radiation by dusty clouds in the AGN 'torus' 
on ~pc-s cales, which sub sequently re-radiate this energy at IR fre- 
quencies IBarvaim sll987h . Despite the appa rent large scatter in th e 
observed SED of large QSO/ AGN samples dRichards et aHl2006j) . 
there appear to be little evidence for substantial systematic changes 
in the shape of such intrinsic SED (apart from those introduced by 
dust extinction) across wide luminosity ranges, as recently demon- 
strated by the analysis o f broad Ho- line sele cted local AGN from 
the SDSS (DR7) survey dStern & Laorj|2012l) . 

On the other hand, the optical SED of a galaxy (see e.g. left 
panel of Fig. [2} is usually modeled as due to the integrated light of 
the stellar populations of the galaxy which are generated by differ- 
ent star-formation histories. Galaxy SEDs peak typically at around 
lyum for a very wide range of SFR histories. 

In most AGN (with the exception of the most luminous ones) 



http ://www. mpe.mpg . de/ir/Research/PEP 



6 A. Bongiorno et al. 



t=50My 




100 1000 10 4 
A(A) 



10 f 



S 5 io 45 

CD 



10' 



-1 II II 1 


1 1 1 1 II -I 


i ii 
n \ 


i 1 1 mi i - 




/ J 

I J 
1 1 

I * 
1 I 
1 t 


\ t \ / 


/ ''' / '% 

/ / 
' / 




1 1 




'/ / - 




1 1 

/ ' 
/ J 
/ J 

1 

r 

i 

i i 
j i 
i i 


' 1 
/ 1 
1 I 
1 1 
1 

I 

t 

1 

1 

1 


/ / 

! 1 — 
1 • 

1 1 

< ! — 
1 ' 

' ; 

i - 




i i 
t t 
i i 

0.1' 0.2 f 
i f 
i i 


1 ' 

1 1 
1 1 

0.3' 3! 
I < 
I J 


i 

i 
I 

91 

f 

j 





mil j i 


l l 

li i 


! 

J 1 



100 1000 10 4 10 5 
A(A) 



Figure 2. Left: Examples of galaxy templates derived from lBruzual & C hariot 1 2003) models. The two solid curves correspond to a model with r=0. 1 and with 
ages of 50Myr and 2Gyr. respec tively. The dash ed curves correspond to the latter SED dust-reddened using Equation[3]with E g (B - V)=[0.1, 0.2, 0.3, 0.4, 0.5]. 
Right: AGN template by Richards et al. (2006) (solid line). The dashed curves correspond to the same SED dust-reddened using Equation [4] with different 
E a (B - V) values [0. 1 , 0.2, 0.3, 3, 9]. 



both components significantly contribute and the global SED is the 
result of the combinat i on of the central QS O's and the host ga laxy ' s 
SED dHao et al.l2009l ; |Merloni et alj20ld : lLusso et alj201lb . How 
much these two components contribute to the global SED in detail 
depends on a number of factors, the most important ones being 
their relative luminosity and the level of obscuration affecting each 
of them. 



bands with some degree of contribution from the host galaxy, while 
the optical continuum of type-2 obscured AGN is dominated by the 
host galaxy emission with the AGN component rising mainly at IR 
wavelength. Notice that we are not implying that both components 
are always detectable. Solutions in which the SED is a pure QSO 
(in case of type-1 AGN) or a galaxy with a negligible contribution 
from the central AGN (in case of type-2 AGN) are both possible. 



3.1 Modeling the AGN and host galaxy emission components 
with SED fitting 

Fixing the redshift of the source to the spectroscopic or photomet- 
ric redshift, we fit the observed fluxes /obs with a two-component 
model using a combinat ion of AGN and host-galaxy template s (see 
e.g. similar approaches IPozzi et alj|20l"ol ; iLusso et alj|201lh (but 
see Appendix [A] for a comparison with the standard SED fitting 
method without th e AGN component). Th is fitting technique, ini- 
tially presented in iBongiorno et all d2007t) . has been already suc- 
cessfully used in the analysis of smaller AGN sub-samples i n 
the COSMOS field (i.e. iMerloni etal]|201Cl : lMainieri et alfcoill) . 
More complex analysis, including multiple dust emission compo- 
nents, have been carried out by Lusso et al. (2012). As we men- 
tioned before, the AGN and galaxy SEDs are shaped in a way that 
allows a relatively easy decoupling with the galaxy's emission peak 
falling in the wavelength range of the AGN dip at ~ 1/jm (rest 
frame). To be specific, we assume that: 



fobs - Cl/AGN + CzfG. 



(1) 



Using a library of galaxy and AGN templates we find the best 
normalization parameters C\ and c 2 that reproduce the observed flux 
of each object, by minimizing the^ 2 . In type-1 unobscured AGN, 
we expect the AGN component to dominate in the optical and IR 



3.2 Library templates 

The data are fitted with a large grid of AGN and galaxy tem plates. 
For the AGN component we adopt the lRichards et al .1 (|2006l) mean 
QSO SED as derived from the study of 259 IR selected quasars 
with both Sloan Digital Sky Survey and Spitzer photometry (see 
right panel of Fig. [2]l. The Richards SED is an extension and is 
consistent with the original lETvis et alj Jl994h QSO SED. For the 
galaxy component we generated a library of s ynthetic spectra us- 
ing the models of stellar population synthesis of lBruzual & Charloll 
(2003). We ass umed a universal initial mass function (IMF) from 
IChabrierlj2003l) and we built 10 exponentially declining star forma- 
tion histories (SFH) S FR oc e^'^ clT with e-folding times, t, ranging 
from 0. 1 to 30 Gyr and a model with constant star formation. For 
each of the SFHs, the SED was generated for a grid of 13 ages rang- 
ing from 50 Myr to 9 Gyr, subject only to the constraint that the age 
should be smaller than the age of the Universe at the redshift of the 
source. Some examples of the used galaxy templates are given in 
the left panel of Fig. [2] 



3.2.1 Extinction 

Both the AGN and the galaxy templates can be affected by dust 
extinction. The observed flux can be written as 



AGN in COSMOS: host galaxy properties 7 




Figure 3. Examples of SED decompositions for unobscured AGN (upper panels) and obscured ones (bottom panels). Black circles are rest-frame fluxes 
corresponding to the observed bands used to constrain the SED. Purple and blue lines correspond to the galaxy and the AGN template found as best-fit solution 
through the x~ minimization, while the black line shows their sum. Pink and cyan shaded areas show the range of the SED template library within l<x of the 
best-fit template, and light gray their sum. 



/obsW = MX) X 10-° ^, (2) 

where / obs and /j nt are the observed and the intrinsic fluxes, respec- 
tively, and A,( is the extinction at a given wavelength A described 
by A i = k(A)E(B - V), with the color excess E(B - V) and the 
reddening curve k{A). 



Galaxies: 

Recent studies on high redshift galaxies and star formation ob- 
scured by dust have shown the importance of the reddening in 
the high-z universe. Therefore, dust extinction, produced inside the 
galaxies themselves, is an important effect to be taken into account. 

We chose as reddening curv e for our galaxy templates the 
Calzetti's law dCalzetti et alfcOOOl) . which is the most used attenu- 
ation curve in high-redshift studies. The Calzetti's law is an empir- 
ical relation derived using a small sample of low-redshift starburst 
galaxies (SB) and has the following form: 



above the valid wavelength range, the slope is computed extrap- 
olating to lower and higher wavelengths the value of k g (A) obtained 
by interpolating between 0.11 - 0.12//m and 2.19 - 2.20yum, respec- 
tively. 

In the fitting procedure we considered for the galaxy compo- 
nent only E e (B - V) va l ues in the range < E 9 { B - V ) < 0.5. 
Following iFontana et al.l d2006t) and lPozzetti et al.l d2007t) . we im- 
pose the prior E g (B — V) < 0.15 if ? ag e/r > 4 (i.e. we excluded the 
models implying large dust extinctions in the absence of a signifi- 
cant star formation rate, t flgc /T > 4). 

In the left panel of Fig. [2] we show two examples of galaxy 
templates with t=0. 1 and ? agc =50Myr, 2Gyr (solid lines). For the 
latter template we also show the corresponding dust-reddened tem- 
plates obtained by applying the above equations with different 
E g (B - V) values (dashed lines). 

AGN: 

The extinction of the nuclear AGN light has been mod eled with an 
SMC-like dust-reddening law from lPrevot et alj d 19841) for which: 



2.659 -2.156 + if* - ^ + 2§I + r v 
s | 2.659 (-1.857 + if2)+i?v 

with R v = 4.05. The two equations are valid for 0.12/jm < A ^ 
0.63 fim and 0.63 jim < A < 2.20 /urn, respectively. Below and 



k a (,A) = 1.39 AT^ (4) 

We considered E a (B- V) values in the range < E CI (B-V) < 1 
for unobscured type-1 AGN and 0.3 < E a (B - V) < 9 for the ob- 
scured ones, in steps of AE„ (B - V) = 0. 1 . In the right panel of Fig. 
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Figure 4. Left Panel: Median AGN SED chosen as best fit for our sample of unobscured (blue continuum line) and obscured (red continuum line) AGN, all 
normalized at rest-frame 12 /im. Shaded areas correspond to the range within 25th and 75th percentiles, while long dashed lines that within the 10th and 90th 
percentiles. Right: Median AGN contribution to the total SED at different wavelengths for unobscured (blue continuum line) and obscured (red continuum 
line) AGN. Shaded areas correspond to the range within the 25th and 75th percentiles, while long dashed lines that within the 10th and 90th percentiles. 



[2] we show the AGN template (solid line) and the corresponding 
dust-reddened templates obtained for different E a {B - V) values. 
As visible in this figure, the upper limit of E a {B - V) = 9 corre- 
sponds to a very extinct AGN SED, and an even higher upper limit 
would not change by much the SED shape. 

Taking into account the described extinction laws, we can re- 
write Eq.Q]as 



(5) 



+ cajgLOi) x io-° M ^ iB - V) 

with k g (/l) described by Eq.[3] 
3.3 Results of the SED fitting 

Given the wide multi-wavelength coverage, the fitting technique 
described above allows to decompose the entire spectral energy dis- 
tribution into a nuclear AGN and a host galaxy component and to 
derive robust measurements of both the AGN and the host galaxy 
properties, for almost all (98.6%) of the objects in the original sam- 
plqj. Some examples of the SED fitting for unobscured (upper pan- 
els) and obscured AGN (bottom panels) are shown in Figure [3] 
while the left panel of figure [4] shows the median SED chosen as 
best fit in the sed fitting for unobscured (blue) and obscured (red) 
AGN together with the 10th, 25th, 75th and 90th percentiles of 
the samples. Finally, the right panel of the same figure highlights 
the median AGN contribution to the global SED at different wave- 
lengths for unobscured (blue continuum line) and obscured (red 

5 In fact, due to an insufficient photometric coverage, 5 sources have been 
excluded from the analysis, while for 1 8 sources no good solution could be 
found due to problems in the photometry or in the SED: the reduced^ 2 in 
these cases is large (>20). For these sources no physical parameters could 
be derived. 



continuum line) AGN. Again the shaded areas and the long dashed 
lines mark the 10th, 25th, 75th and 90th percentiles of the sam- 
ples, respectively. From this figure, we see that, for obscured AGN, 
three quarters of the objects have AGN contribution in the optical 
bands smaller than 20%. Thus, any residual AGN contamination 
(or AGN over-subtraction), should not affect the determination of 
the rest-frame optical colors of the host by more than about 0.1 
magnitude. On the other hand, at rest-frame wavelengths of about 
1/jm, three quarters of type-1 AGN have AGN fractions smaller 
than about 50%, implying that, even for unobscured objects, our 
method allow a robust determination of the host total stellar masses 
(see also Merloni et al. 2010). 

Based on the best fit solution we derived the AGN luminosity 
and the galaxies' rest-frame magnitudes, colors, stellar mass con- 
tent and star formation rate. One sigma errors on the best fit param- 
eters are computed considering the range of values corresponding 
to the solutions for which A^ 2 = x 2 (sol) - x 2 (besi) < 1.0, corre - 
sponding to lcr in the case of one parameter of interest jAvnil 19761) . 

In some cases we are not able to derive a measurement for the 
galaxy or the AGN components but we can still define a meaningful 
upper limit on the unconstrained component. In unobscured type- 
1 AGN, if the rest-frame K-band luminosity of the host galaxy is 
smaller than 10% of the AGN luminosity in the same rest-frame 
band, we assign an upper limit to the host galaxy SED component. 
This is the case for 34 type-1 AGN for which the value of the upper 
limit of the galaxy K-band luminosity is taken as the highest possi- 
ble value of the parameter within the uncertainty. In these cases we 
assign an upper limit also to the object's total stellar mass assuming 
the median mass-to-light ratio of all other objects in the sample. No 
rest-frame magnitudes in U- and B-band are derived in these cases. 

On the other hand, for 279 obscured type-2 AGN which are 
not detected in the 24/jm Spitzer/MIPS band, we can provide only 
upper limits on the AGN component (and thus on the AGN lumi- 
nosity). This is because in heavily obscured sources the AGN com- 
ponent rises almost exclusively at NIR wavelengths, and without 
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Figure 5. Mixing Diagram from lHao et al\\201(\) : slope - slope plot of the entire sample of analyzed AGN. Circles correspond to XMM-COSMOS obscured 
(red) and unobscured (blue) AGN; green and yellow triangles show the optically selected AGN (selected using the BPT diagram and the NeV line). Open and 
filled circles correspond to photometric and spectroscopic redshifts. The big star show the R06 mean SED while the asterisks show different BC03 galaxy 
templates. The black line indicates the boundaries of the possible slopes obtained by mixing the R06 template with different fractions (0% - 100%) of a given 
galaxy template. The orange line shows the reddening vector of R06 with the triangles corresponding to steps of 0.1 in E(B-V). 



the 24yum detection it is not possible to determine reliably the AGN 
contribution to the optical/IR SED. For these sources, we derived 
an upper limit on the 12yum rest-frame AGN luminosity (see be- 
low) using the 24/jm detection limit and applying the appropriate 
k-correction. We note here that in these cases, we can still use the 
observed X-ray (or emission line) luminosity to infer the level of 
AGN strength in the source. 



Summarizing, from the initial sample of 1702 sources the SED 
fitting was successfully applied to 1679 sources. Of these objects, 
34 type-1 AGN have only an upper limit in the galaxy component 
parameters (mass and rest-frame M K ) while no U- and B-band rest 
frame magnitudes were assigned. For 279 type-2 AGN only an up- 
per limit in the AGN component parameters (L^j^ and thus L bol ) 
could be derived. All the derived parameters are available online. 
An example of the published table can be found in Table IC2l 



3.4 AGN-galaxy Mixing Diagram 

Based on our SED fitting procedure described in the previous sec- 
tion, Figure [5] shows the rest-frame NIR and optical slopes of 
the total best fi t SED of the sources in our sample (adapted from 
lHao et af]|201fjh . For each source we plot the slopes on either side 
of the rest-frame l^m which is approximately the inversion point in 
the typical QSO SED in a vf(v) versus v representation (i.e. where 
the exponent of a locally power-law fit to the SED changes sign). In 
particular, we computed the IR slope (cnir) between l/um and 3/jm 
and the o ptical slope (qtopt) between 0.3/im and 1/jm (for more 
details see lHao et aObOirjI l lEivis et al.ll2012l) . The location of the 
sources in this 'color-color' space is then shown in Fig. [5] where 
circles represent XMM-COSMOS obscured (red) and unobscured 
(blue) AGN, while triangles correspond to the optically selected 
sources without X-ray counterpart (green for AGN selected with 
the diagnostic diagrams and yellow for NeV selected AGN). Open 
and filled circles correspond to photometric and spectroscopic red- 
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Figure 6. Left: In the lower panel we show the stellar mass distribution of the AGN host galaxies for the different analyzed sub-samples: shaded blue and red 
for spectroscopically confirmed X-ray type-1 and type-2 AGN, empty cyan and red for X-ray selected type-1 and type-2 AGN with photometric redshift, 
dashed green and yellow for optically selected through emission lines (green: BPT diagram and yellow: NeV line) without X-ray detection. The top panel 
shows instead the entire sample (thick open histogram) and divided in type-1 (shaded blue) and type-2 (shaded red) AGN. Right: Host galaxy stellar masses 
versus redshift for the entire AGN sample with the usual color-coded sub-classes division. We also show in gray the distribution of masses for the I-band- 
apparent-magnitude-matched parent galaxy population. 



shift. Note that the same symbols and color-code is used throughout 
the paper. 

In the plot, the big star indicates the SED of our pure 
type-1 AGN templa te, for which a O pT=0.85 and a N1R =-0.49 
dRichards et~a7]|2006l . hereafter R06). Starting from this point, the 
orange line shows the R06 template when reddening is applied 
(the orange triangles correspond to steps in E(B-V) of 0.1). The 
asterisks show the position in this diagram of different galaxy 
templates derived from th e stellar population synthesis models of 
iBruzual & Charlotl j2003l hereafter BC03) adopted: from passive to 
extreme starbursts going from left to right. The black line indicates 
the R06 template when mixed with different fractions (0% - 100%) 
of a given galaxy template. This mixing curve defines the bound- 
aries of the possible slopes obtained by mixing the R06 with this 
particular galaxy template. This plot shows the three main com- 
ponents of the typical SED of the analyzed sources (AGN, host 
galaxy and extinction) and their relative contributions to the total 
SED and serves as a powerful illustration of the level of "contam- 
ination" of the optical/IR SED of AGN, at least at the luminos- 
ity levels probed by the COSMOS survey. As visible in figure, for 
type-1 unobscured AGN the galaxy contamination is less predom- 
inant than in type-2 obscured AGN, where high value of extinction 
are associated to high contamination of the host galaxy which can 
dominate in the 0.3//m - 3/jm interval. Moreover, there are no ob- 
jects along the AGN extinct line at E(B-V)>1 showing that when 
the AGN absorption is high, the galaxy contribution becomes im- 
portant. 

Most of the sources lie on the left-side region of the Q- pT-aNiR 
plot, and only a small percentage (and only obscured AGN) lie in 
the upper right corner. This is a consequence of the fact that in 
nature galaxies with high star formation rates typically show high 
levels of obscuration, too. This means that, while we do include 



non-extinct starburst templates in our library, these solutions are 
almost never chosen because unrealistic (unless t age or T/t age are 
very low, as is the case for the few points in that region of the plot). 

Unob scured AGN in t his region of the plot have been inter- 
preted by lHao et al" I f2oToh as hot-dust-poor (HDP) quasars i.e. 
normal quasar with a relatively weak IR bump. However, for the 
obscured ones, another possible explanation is that these sources 
are normal AGN hosted by starburst galaxies which steepen the 
UV slope. Looking one by one at the SED of all these sources 
we found that all but four of them are not detected at 24//m and 
their SED in the UV-optical part is dominated by a starburst galaxy. 
Most of these sources are Seyfert-2 galaxies optically selected us- 
ing BPT diagrams. Some of them, due to the errors associated to the 
line measurements, can be misclassified normal star forming galax- 
ies. Alternatively they could be very faint AGN hosted in starburst 
galaxies. 

3.5 Measuring AGN host stellar masses and star formation 
rates 

3.5.1 Stellar masses 

The SED fitting procedure allows us to estimate the total stellar 
mass and the star formation rate (SFR) of the AGN host galaxies. 
Knowing the IMF, for a given star-formation history, each combi- 
nation of t and f agc (see Sec. 13. lb is in fact uniquely associated to 
a value of specific star formation rate (sSFR=SFR/M, see Sec. |4.4| > 
and, given the normalization of the template, total stellar mass. 

For our AGN COSMOS sample, we have thus derived a robust 
estimate of the host galaxy stellar masses for 1650 objects (out of a 
total number of 1702 attempted SED fits). As explained in Sec. 13. II 
for 34 type-1 AGN we could derive only an upper limit on the 
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stellar mass, while for 18 objects the SED fit failed due to problems 
in the photometry or in the SED shape, so no mass estimation was 
possible. 

The top left panel of figure [6] shows the host galaxies stellar 
mass distribution for the entire AGN sample (open histogram) and 
divided in type-1 (hatched blue) and type-2 (hatched red) objects. 
The bottom panel shows a detail of the M, distribution in the differ- 
ent sub-samples, color-coded as usual. Masses range mainly from 
10 10 M o to 10" 5 M with apeak at 10 10 9 M o . No significant differ- 
ence is found between the type-1 and type-2 AGN host population 
which show on average the same host galaxy mass distribution. In- 
terestingly, the hosts of optically selected Seyfert-2 galaxies with- 
out X-ray counterpart show a tail in the distribution at lower masses 
down to 10 85 M Q . 

The right panel of Fig. [6] shows instead the stellar masses of 
the AGN hosts as a function of redshift superimposed on the under- 
lying distributio n of stellar masses of non-AGN in the COSMOS 
area drawn from lllbert et al.l l l201(j|) and matched to the AGN pop- 
ulation in redshift and I-band apparent magnitude (see Sec. l2.4t . 

3.5.2 Star formation rate 

The measure of the host galaxy SFR is subject to considerably 
larger uncertainties than that of stellar mass, in particular for type-1 
AGN. In general, the most reliable way to compute the SFR is sum- 
ming up the rate of unobscured star formation (emitted in the UV) 
with the obscured one re-emitted in the (mid- and far-) IR by dust 
(SFRuv+ir)- Our optical/IR SED fitting procedure relies only on the 
UV emission measurements, which traces only the unobscured SF, 
which is however scaled up by the dust correction factor computed 
using the full SED to account for the latter contribution. 

The issue of the reliability of UV-based indicators of star for- 
mation rates in galaxies is a long-standing one. With the advent of 
the Herschel telescope, sensitive Far-IR measurements of cold dust 
heated by star-forming processes has become available for deep 
and medium-deep surveys of the extragalactic sky, providing a first 
direct handle of SFR U V+IR in large samples of galaxies (see e.g. 
iRodighiero et alj|20ld) . For complete samples of AGN selected in 
both X-rays and optical spectroscopic surveys, however, the final 
answer to the question of what are the intrinsic star-forming prop- 
erties of the AGN hosts requires SFR indicators that are sensitive to 
both star-forming galaxies and passive/quiescent ones. This is not 
possible even for Herschel, whose sensitivity to individual galaxies 
does not reach into the quiescent population at the reds hifts (z > 1) 
of deep extragalactic survey s such as COSMOS (see e.g. lShao et ai] 
l20ld : lMullanev et alj2012[) . Indeed, only ~ 10% of the AGN in our 
sample are detected above the 3cr limits of the PEP catalogues in 
the COSMOS field. In Appendix|B]we compare the SFR estimated 
from the SED fitting with the one derived with other methods (i.e. 
FIR observation and optical emission lines). We find that above 
~20 Mq yr~' the agreement between the SFR derived from the 
SED and the one derived from the FIR bands is quite good, while 
at lower SFR (< 20Mq yr -1 ) the disagreement becomes evident, 
with SFR(FIR) being systematically higher than SFR(SED). We 
interpreted this discrepancy as a combination of a possible contam- 
ination from SF in the NIR not properly taken into account in the 
SED fitting together with a not-negligible AGN contamination to 
the FIR bands not taken into account in the FIR SFR estimates(see 
e.g. lRosario et al.|[2012l) . This would imply that, while our SFR es- 
timates can be in some cases underestimated, the FIR-SFR can be 
overestimated especially at lower SFR where the AGN contamina- 
tion would represent a significant fraction of the IR emission. 



In any case, the very nature of our objects, and the SED de- 
composition technique with which we analyze them, require some 
special care in the interpretation of the SFR measurements. The 
main problem with type-1 (unobscured) AGN is that the accretion 
disk emission strongly contributes in the UV band (see upper pan- 
els of Fig. [3} introducing a degeneracy in the SED fitting between 
the UV emission from star formation and from the central AGN. In 
practical terms this means that, in some cases the SED fitting can 
give two significantly different solutions with a similar^ 2 : an un- 
obscured AGN, with a prominent Big Blue Bump (BBB) dominat- 
ing in the UV range with a passive/moderately star forming galaxy 
or, conversely, a moderately obscured AGN with a strongly star- 
forming galaxy, which substantially contributes to the observed UV 
emission. Since it is not possible to disentangle a priori this kind of 
degeneracy, we will not take into consideration the SFR measure- 
ments of type-1 AGN for the further discussion. 

On the contrary, since in obscured AGN the UV emission is 
suppressed by obscuration (see, as an example, the bottom panels 
of Fig. O, the UV range is clean from AGN contamination and a 
reliable estimate of the SFR can be derived from the SED. 

Figure [7] shows the host galaxy SFR distribution in three red- 
shift bins for 1090 obscured type-2 AGN (thick solid line) and 
for the different sub-samples with the usual color code. AGN 
host galaxies span a very wide range of SFR, with X-ray selected 
AGN populating the entire range, while optically selected Seyfert- 
2 galaxies (which are mainly at low redshifts) are hosted predomi- 
nantly in galaxies with low SFR (but very low masses as shown in 
Fig. [6} . This can highlight a physical difference in the hosts of opti- 
cally selected Seyfert-2 galaxies or can be, at least partly, driven by 
the fact that high SFR implies strong emission line dilution which 
causes a certain fraction of AGN to be missed by the emission line 
ratios (BPT) selection. 



3.6 Measuring nuclear L 12 ,,„, and bolometric luminosities 

Before proceeding to a comprehensive discussion of the observable 
(and physical) properties of the host galaxies of the AGN selected 
in COSMOS, we present some basic "intrinsic" properties of the 
AGN population itself. This will be also a useful reminder that, in 
terms of the physical characteristics of growing black holes and of 
their immediate (nuclear) environments, every large survey selects 
families of objects occupying specific parts of the generic param- 
eter space identified by the distance of the source, its accretion lu- 
minosity, BH mass, and level of nuclear obscuration. 

To begin with, we note that from the best fitting (obscured 
or un-obscured) AGN component in the SED, we can derive an 
estimate of the AGN luminosity at (rest-frame) 12/jm which can 
be used as a reliable estimator of the AGN bolometric luminosity. 
The intrinsic AGN 12yum luminosity is calculated by integrating 
the AGN component obtained from the best SED fit over a nar- 
row (i.e., lyum) pass-band centered on 12^m. By construction, and 
given the shape of the BC03 templates used to fit the galaxy compo- 
nent in every system, the mid-IR band is assumed to be completely 
dominated by the nuclear AGN light reprocessed in the obscuring 
molecular structure on ~ 1 - 10 pc scale. Such a drastic assump- 
tion is likely not to be correct for a number of individual objects, 
which may have a significant galaxy contribution (not included in 
BC03 templates) at 12/jm. However, globally it is not unreasonable 
for the studied AGN sample as suggested in Fig. [8] where the AGN 
12yum luminosity is plotted against the intrinsic (de-absorbed) X- 




Figure 7. Star formation rate distributions in three redshift bins of the AGN host galaxies for the entire type-2 AGN sample (thick open histogram) and 
divided in different sub-samples: shadow and empty red for X-ray spectroscopically confirmed and with only photometric redshift type-2 AGN, dashed green 
and yellow for optically selected through emission lines (green: BPT diagram and yellow: NeV line) without X-ray detection. The gray histogram shows as 
comparison the distribution of the mass-matched parent sample. To make easier the comparison, the latter has been divided by a factor of two. 
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Figure 8. X-ray [2-10] keV versus mid-infrared i2/im luminosities for the 
sample of ~ 1550 AGN with X-ray detection and measured Li2, lm . Red cir- 
cles are spectroscopic (filled) and photometric (open) type-2 AGN while 
type-1 AGN are represented with filled (spectroscopic) and open (photo- 
metric) blue circles. The thin solid green diagona l line i s the [2-10]keV - 
12.3//m luminosity relation found by lGandhi et al.| j2009|) for the sample of 
22 well resolved local Seyfert. The thick solid lines are the fitted correlation 
considering our entire sample (black) and only the spectroscopic sources 
(gray), while the long-dashed blue and the short-dashed red correspond to 
the type-1 and type-2 sub-sample. 



Figure 9. Histograms of the r atio between the mea sured 12.3 fim luminosity 
and the one predicted by the iGandhi et al.[ j2009t) relation on the basis of 
the observed 2-10 keV X-ray luminosity. In the top panel we show type-2 
AGN, with the red histogram marking objects in the spectroscopic sample 
and the purple one those with photometric redshifts; filled histograms are 
for upper limits. In the bottom panel we show instead the type-1 AGN, 
with the dark histogram marking objects in the spectroscopic sample and 
the cyan one those with photometric redshifts. Yellow histograms in both 
panels show the outliers in the mixing diagram of Fig.|3](see text for more 
details). The two solid orange lines delimit 2cr from the Gandhi relation 
(mean equal to zero and standard deviation cr=0.23). 
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Figure 10. Upper Panels: Mid-IR and bolometric luminosity distribution of 
the entire sample (thick open histogram) and for the type-1 (shaded blue) 
and type-2 (shaded red) AGN sub-samples separately. Bottom Panels: Mid- 
IR and bolometric luminosity distribution of the different sub-samples in- 
cluded in our AGN sample: shaded blue and red for X-ray spectroscopically 
confirmed type-1 and type-2 AGN, empty cyan and red for X-ray selected 
type-1 and type-2 AGN with photometric redshifts, dashed green and yel- 
low for optically selected through emission lines (green: BPT diagram and 
yellow: NeV line) without X-ray detection. 

ray [2-10] keV luminosit^ for the XMM selected sample of -1550 
objects with X-ray detections and Li 2/im measurements. 

Using a sample of local Seyfe rt galaxies (both Se yfert-1 and 
Seyfert-2) at high spatial resolution. lGandhi et alj d2009l) found that 
the uncontaminated nuclear mid-IR continuum of AGN closely cor- 
relates with the X-ray [2-10] keV AGN emitted powers over three 
orders of magnitude in luminosity. To be specific, the best-fit corre- 
lation they obtained considering only the 22 well-resolved sources 
is shown as a thin green line in Fig. [8](L0g(L MIR /lO 43 ) = (0. 19 ± 
0.05) + (1.11 + 0.07)Log(L x /10 43 )). Our data follow the Gandhi's 
relation. As a comparison, the fit to our data performed adopt- 
ing the ordinary least-square bisector estimate recommended by 
llsobe et alj Jl990h . which gives identical weighting to all sources, 
gives Log(L M i R /10 43 ) = (0.10±0.03) + (1.24±0.03)Log(L x /10 43 ) 
for the entire sample and Log(L MIR /10 43 ) = (0.29 ± 0.03) + (1.13 ± 
0.03)Log(Lx/10 43 ) if we consider only the spectroscopic sample. 
The fits are shown respectively as black and gray solid thick lines 
in Fig. [8] together with the fit for the type-1 (blue long-dashed line) 
and type-2 (red short-dashed) AGN sub-samples. The good agree- 
ment between the predicted and the measured 12.3yum luminosities 
is also shown in Fig. [9] where we show the histogram of the ratio 
betwe en the measured 12 .3 fim luminosity and the one predicted 
by the lGandhi etafl d2009h relation for type2 (top panel) and typel 
AGN (bottom panel). We notice that, while the distribution of the 

6 Calculated on the basis of the observed fluxes in the soft (0.5-2 keV) and 
hard (2-10 kev) X-ray energy bands, of the known redshift, and assuming, 
as X-ray spectral model, an intrinsic power-law of slope T = 1.9 ± 0.2, 
absorbed by neutral gas. 



ratio between the measured and the predicted MIR luminosity is 
centered at z errfl the obse r ved sc atter is much larger than the one 
measured by iGandhi et alj j2009h (2cr from the Gandhi relation is 
delimited with orange lines: mean equal to zero and standard de- 
viation cr=0.23). If we consider only the sources with a spectro- 
scopic redshift (black solid line), we do see a clear asymmetry in 
the distributions, as expected due to the un-accounted contribution 
of star-formation-related emission in the NIR (see also Lusso et 
al. 2011). Indeed, while ~61% of the objects are found within 2cr 
of the distribution, the tail extending to lower values includes 94 
sources (12%), while for a higher number of sources (211, 27% of 
the sample) the observed mid-infrared luminosities are significantly 
higher than the predicted ones. This is however not the case for the 
photometric sample. A possible explanation of this effect might in- 
volve the effect of the uncertainties in the photometric classifica- 
tion in some of the AGN for which we miss spectroscopic informa- 
tion. Indeed, we have tested that all the small fraction of outliers 
in the diagram of Fig. [5] (i.e. type-1 AGN with ccopt < -1 and 
type-2 AGN with ccopt > show a broader distribution in the 
ii2.3/im,obs/ii2.3^m,pred ratio with a slight skew towards values less 
than unity. 

Part of the observed differences can also be attributed to the 
fact that we are extending the Gandhi relation, which was derived 
for a sample of local Seyfert galaxies, to a sample spanning a much 
wider luminosity and redshift range. However, it is also plausible 
that in a fraction of these sources the SED-fitting procedure over- 
estimates the nuclear contribution resulting in a higher Li2.3,,m,obs 
compared to the predicted L 12 .3^ m , pre d- How much of this scatter 
is inherently due to the physical conditions of the AGN and torus 
clouds, as compared to observational selection effects, remains an 
important unresolved issue, which is beyond the scope of this pa- 
per. However, the comparison points out that on average the 12.3/jm 
luminosity derived from the SED fitting is a reasonably good mea- 
sure of the mid-IR AGN luminosity and that indeed the assumption 
that the mid-IR emission is dominated by the AGN emission due to 
accretion onto the central black hole rather than SF from the host 
galaxy is plausible for most of the sources. 

One clear advantage of adopting the rest-frame MIR lumi- 
nosity, Li2^ m , as a universal AGN power estimator for our sam- 
ple is that, contrary to the [2-10] keV luminosity, this quantity was 
measured homogeneously for all the sources, including the objects 
with no X-ray emission. Hence it can be used to compare the AGN 
power in the different sub-samples and to derive in a homogeneous 
way the bolometric luminosity for the entire sampl^E This is shown 
in Fig. 1 101 where both the 12/im and t he derived bolometric lumi- 
nosity using the iHopkins et al] | |2007|) conversions are shown for 
the different sub-samples (bottom panels) and for the full sample 
divided in obscured and unobscured AGN (upper panels). Bolo- 
metric luminosities range from 10 43 to 10 47 erg s" 1 . 

Optically selected Seyfert-2 galaxies (green histogram) are 
among the faintest sources in our sample extending down to 
Lboi=10 43 erg s" 1 . Since their 12//m luminosities are of the same 
order of the X-ray selected obscured sources, the lack of X-ray de- 
tection can be due to the fact that these sources lie in the tail of 
the Gandhi relation (Fig. [9}. Alternatively, some of these sources 



7 in contrast to lLusso et al.U20lH) which found a shift of 0.2 towards higher 
values when considering the ratio between the total (AGN+GAL) SED and 
the Gandhi predicted values. 

8 However, for a more detailed study of t he bolometric luminositie s of 
XMM-COSMOS AGN we refer the reader to lLusso et al] j201Cj|201lh . 
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can be Compton-thick AGN, as shown bv lGilli etalJfeOld) for the 
[Nev] -based sample, and, due to the high N H absorption, they are 
not detected in the X-ray band. 



4 THE PHYSICAL PROPERTIES OF COSMOS AGN 
HOST GALAXIES 

4.1 Rest frame colors 

The unique strength of the COSMOS AGN sample at hand, lies in- 
deed with the possibility, by means of our SED fitting technique, of 
reliably removing the AGN component from the overall SED of any 
AGN host. In this section, we thus discuss rest-frame observables 
(absolute magnitudes and colors) of the AGN host galaxy compo- 
nent, which are, by construction, little contaminated by the AGN 
colors. We also investigate the properties of the AGN host galaxies 
compared to normal galaxies. 

Figure QT] shows the rest frame U-B colors versus the B- 
band absolute magnitude (the color-magnitude diagram, CMD) 
for our sample of AGN hosts divided in three different redshift 
bins ([0.3<z«0.8], [0.8<z<1.5] and [1.5<z<2.5]). The gray con- 
tours show for co mparison the whole galaxy sample drawn from 
lllbert et aljfcoioh in the same redshift intervals, while grey points 
correspond to the mass-matched parent sample described in Sec. 
12.41 We have chosen to show both the 'measured' values of the 
U-B colors (upper row) and the extinction corrected ones, where 
the value of the extinction is that obtained for the galaxy compo- 
nent from the best fit SED decomposition (see section [3.2,lb . This 
is an important test, as galaxies with red colors can be either dusty 
star-forming galaxies or galaxies with old stellar populations, and 
any interpretation of the location of AGN hosts in the CMD may 
be strongly a ffected by the treatment of dust extinction in these sys- 
tems (see e.g. lCardamone et alj2010l) . Our rich and comprehensive 
multi-wavelength dataset is very well suited to attempt a robust de- 
termination of the large scale extinction properties of the galaxies 
in the various samples. 

First of all, a note of caution is in place, as, at any given host 
galaxy magnitude, the colors of the type-1 hosts are more and more 
unreliable the higher the nuclear AGN luminosity. In this respect, 
very blue colors of bright type-1 AGN (see, in particular, the clus- 
ters of sources at U-B^O.1-0.3 in all extinction corrected CMD) 
should not be too trustworthy, as they most likely indicate objects 
for which the SED decomposition, and thus the AGN blue light 
subtraction, may give uncertain and possibly unreliable results. We 
will come back to this point when discussing the inferred star for- 
mation rate properties of the AGN hosts. 

We now move to the discussion of the CMD of the general 
AGN hosts population. Underneath each of the diagrams, Fig. [TT1 
shows the fraction of all AGN (black), type-1 AGN (blue) and 
type-2 AGN (red) relatively to the whole galaxy population in the 
corresponding redshift bin as a function of B-band magnitude. For 
consistency, in the computation of these fractions we restrict the 
analysis to the X-ray detected sample, the completeness of which 
can be properly assessed in a relatively simple way (see e.g. Sec 14. 31 
below), noticing however that the only effect of the pure optical 
sources is to slightly flatte n the slope at lower redshifts. In agree - 
ment with previous studies dNandra et al.l2007luBrusa et al. 2009b), 
the AGN are almost exclusively hosted in bright (M B < -20) galax- 
ies and the fraction of galaxies hosting AGN increases going to 
higher luminosities. 

No significant difference is found in the host's colors of 
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unobscured and obscured AGN, in contrast with what expected 
from the most popul ar models of AGN fueling and feedback (e.g. 
iHopkins etaT]|2008h but in agreement with the standard unified 
model l lAntonuccil 19931) . As a matter of fact, the relative fraction of 
type-1 and type-2 AGN changes only slightly with B-band lumi- 
nosity in each redshift interval, but the overall fractions of obscured 
and unobscured AGN are markedly different in the three redshift 
bins. This is an effect introduced by the well known anti-c orrelation 
betwe en obscured AGN fraction and luminosity (see e.g. lHasinged 
120081) in combination with the almost constant (X-ray) flux limit 
of the XMM-COSMOS survey, which implies that high redshift 
sources are more luminous than low redshift ones. A comprehen- 
sive discussion of the incidence of obscured and un-obscured AGN 
in the COSMOS field as a function of luminosity, redshift and host 
galaxy properties will be presented in a separate work (Merloni et 
al. in prep.). 

We find no evidence of color bi-modality for AGN hosts, 
which populates mainly the red and intermediate part of the bi- 
modal color magnitude distribution observed for normal galaxies. 
Interestingly, at 0.3< z <0.8 the low luminosity, optically selected 
(from BPT diagram) Seyfert-2 galaxies without X-ray counterpart 
(green triangles) seem to behave in a different way, being mainly 
hosted in galaxies populating the blue and the intermediate region. 
The effects of dust extinction correction on the AGN distribution in 
the color-magnitude diagram are clearly visible in the lower panels 
of Fig.QT] where some objects populating the red sequence moved 
to the intermediate U-B region and some that w ere in the interme 
diate region lie now in the blue region (see also 
2010). However, in contrast to what was found by 
(2010), no clear bi-modality is seen in any of the three panels even 
after dust correction is applied. More importantly, it is clear from 
Fig. QT] that dust extinction correction affects more severely the 
overall color distribution of the parent galaxy sample. This can be 
seen in the fact that the AGN fraction in the B-band most luminous 
galaxies in the sample decreases once dust correction is included. 

Indeed, at any redshift, AGN hosts are found mainly in red 
galaxies, albeit with a substantial tail towards intermediate and 
bluer colors (see also Fig.l 12t. In many previous studies, AGN were 
described as populating preferentially an interm ediate region, the 
so called "green valley" dSilverman et al .1120090 . with the possible 
interpretation that they live in quenched galaxies going through the 
transition phase between active and passive star-formation phase. 
However, most of these works were focused on small subsample 
of the AGN population and, more important, the AGN component 
was not subtracted in the computation of the host colors leading to 
a possible blue contamination from the AGN which in fact would 
push red hosts into the green valley. Moreover, the net effect of 
the dust correction applied systematically, and in a uniform way, to 
both the parent and the AGN host galaxy samples is to decrease the 
fraction of AGN in the most star-forming objects. 

In general, the often contrasting interpretations of many previ- 
ous stud ies of the CMD of AGN hosts from deep multi-wavelength 
surveys dNandra et alj20q7l:lsilverman et al.l2009llXue et alj2010l : 
ICardamone et alj l20ld : iRosario et alj 1201 If) demonstrates that 
studying their position within optical color-magnitude diagrams is 
hardly a robust way to unveil the star-forming nature of AGN hosts. 
This is because the interpretation of optical colors in terms of star- 
formation properties of a galaxy can suffer from various, degen- 
erate, biases, especially when dealing with distant AGN hosts. To 
counter such degeneracies, at least partly, in section |4T4l we will dis- 
cuss in more detail how AGN hosts are distributed in the physical 
parameter space of stellar mass and star formation rate. 
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Figure 11. Upper Panels: U— B vs Mb color-magnitude-diagram AGN host galaxies of unobscured (blue circles) and obscured (red circle) X-ray AGN and 
optically selected (green and yellow triangles) AG N in three redshift b ins. Open and filled circles correspond to photometric and spectroscopic redshift. Grey 
contours show the whole galaxy sample drawn from lllbert et al.l J"20 1 Oh while grey points correspond to the mass-matched parent sample described in Sec. 12.41 
Lower Panels: Same as above but now Mb and U— B are de-reddened using the E(B-V) derived from the SED fit. The bottom part of each panel shows the 
fraction of galaxies with an AGN as a function of B magnitude, considering all AGN (black circles), only type-1 (blue squares) and only type-2 (red triangles) 
AGN. For these fractions, we consider only X-ray selected AGN and the whole galaxy sample. 



4.2 Host galaxy color-mass diagrams 

Figure[72lshows the position of our AGN sources in the color-mass 
diagram, where the colors (U-V) are corrected for extin ction. Black 
contou rs show the whole galaxy sample drawn from lllbert et alj 
while grey points correspond to the i-band-apparent- 
magnitude-matched parent sample described in Sec. 12.41 Compar- 
ing with the whole galaxy population, we find that, at any redshift, 
AGN are preferentially detected in massive galaxies, in agreement 
with previous studies terusa et alj l2009bl ; Isiiverman et alj|2009l ; 
IXue et alJl20ld : lMullanev et alj|2012h . The trend is still present if 
we use as comparison sample the i-band-matched sample. In the 
next section we will investigate in greater detail the evolution of 



the AGN fraction as a function of redshift, X-ray luminosity, stel- 
lar mass of the host, and of the ratio of these two latter quantities. 
Here we note, however, that except for few unobscured AGN in 
the intermediate z bin showing extreme (and probably unreliable) 
blue colors, obscured and unobscured AGN overlap very well in 
the color-mass plane. 

Since massive hosts tend to be redder, the trend with mass and 
the one with colors seen in the previous section should go in paral- 
lel. In many previous studies it has been pointed out how, removing 
the mass dependency i.e. using a mass-matched control sample, 



the trend with color disappears dSilverman et all 1 20091 ; IXue et all 
120101 : iMainieri et alj|201 ll : iRosario et al.ll201 ifT However, this is 
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Figure 12. Extinction corrected color (U-V) vs mass diagram of AGN host galaxies of unobscured (blue circles) and obscured (red circle) X-ray AGN and 
optically selected (green and yellow triangles) AG N in three redshift b ins. Open and filled circles correspond to photometric and spectroscopic redshift. Grey 
contours show the whole galaxy sample drawn from lllbert et all l2010l) while grey points correspond to the i-band-apparent-magnitude-matched parent sample 
described in Sec. 12.41 



not the case in our study. Fig [13] shows the extinction corrected 
U-B color histogram for AGN hosts in the usual three redshift 
bins, for three different ranges of stellar masses, all above the com- 
pleteness threshold of the Ilbert et al. (2010) parent galaxy sample, 
divided into unobscured (shaded blue) and obscured (shaded red) 
AGN and compared to the mass-matched galaxy sample. Galaxies 
hosting an AGN tend to be redder than normal galaxies of compa- 
rable stellar mass (the median color shift ranges between ~ 0. 1 and 
~ 0.25). 

Following our discussion in the previous section, we can spec- 
ulate about two main reasons for the discrepancy between our color 
distributions of AGN hosts in mass-matched samples and previous 
studies. First of all, our SED decomposition technique allow a more 
reliable correction for AGN light "pollution" that would typically 
move the color of an AGN host towards the blue, star-forming se- 
quence. Second, thanks to the unique multi-wavelength coverage 
of the COSMOS field, we are able to extinction-correct, in a sys- 
tematic way, all galaxy colors, both in the parent and in the AGN 
host samples. We find that moving dusty star-forming galaxies back 
to the blue cloud with such a correction effectively decreases the 
incidence of AGN among star-forming galaxy, while, conversely, 
the relative fraction of AGN hosts in truly passive galaxies is en- 
hanced. We caution however that, being these color shifts relatively 
small, systematic effects due to a non-optimal AGN-galaxy decom- 
position (AGN over-subtraction, see section [3~3l or to the specific 
choice of stellar population templates, could still be held responsi- 
ble for some of this observed trend. 



4.3 The incidence of AGN as a function of the host galaxies' 
stellar masses 

Let us now move to a quantitative assessment of the fraction of 
AGN as a function of host stellar mass. In order to do so, we need to 
take carefully into account the selection function of our survey. For 
this reason, in this section, we will consider only the X-ray selected 



AGN, taking advantage f rom the well defined ex posure map of the 
XMM-COSMOS survey dCappelluti etai1l2009l) . 

So far, one of the clearest pieces of evidence emerging from 
a variety of comparative studies of AGN and host galaxies in large 
multi-wavelength surveys is the fact that the likelihood of finding 
an AGN is a very strong function of the galaxy stellar mass, with 
AGN fractions (or, equivalently, duty cycles) increasing with in- 
creasing host galaxy stell ar mass. This is a particularly stro ng effect 
for radio-selected AGN teest et alj|2005l ; lsmolcicll2009h, but has 
been confirmed also for X-ray select ed AGN terusa et alj|2009bl : 
IXue et alj|2O10l ; lMullanev et alj|2012l) . 



4.3.1 AGN fraction as a function of X-ray luminosity 

First of all, we compute the probability to find an AGN within a 
logarithmic bin of a given X-ray (2-10 keV, absorption corrected) 
luminosity, Lx, in different bins of stellar mass by taking the ratio of 
the number of AGN with measured host's stellar mass in that range 
divided by the number of galaxies (of all morphological, color and 
star-formation activity types) having stell ar mass in the same range 
in the parent sample of lllbert et all d2010h : 



Pagn(Lx\M„z) ■ 



dN AGN (z) dlogM, 
dlogM,dlogL x rfA'gaife) 



(6) 



In practice, the sensitivity limits of the XM M-Newton obser- 
vation s vary spatially across the field (see Fig. 5 of lCappelluti et al] 
120091) . hence to derive the fraction of galaxies hosting an AGN as a 
function of the stellar mass, we followed the technique discus sed in 
Sec. 3.1 of lLehmer etai1l2007l (see also lMainieri et alj|201 ll) . The 
Pagn(Lx\M,,z) for each stellar mass and luminosity bin is deter- 
mined by the following equation: 



dN AGN (L x \M t ,z) 
dN gll \(M„z)dLogL x 



(7) 
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Figure 13. Rest frame extinction-corrected U-B color (normalized) distribution of AGN host galaxies in three redshift (from left to right) and mass (from 
bottom to top) bins. The black solid line corresponds to the total sample which is further divided into unobscured (blue shaded histogram) and obscured (red 
shaded histogram) AGN while the filled gray histogram corresponds to the galaxy sample. For e ach redshift interv al we have considered only masses above 
which the parent sample is complete in stellar mass for both passive and star forming galaxies Albert et al1l2010l) . In each panel the median shift in color 
AC a <(t/ - B)agn) - ((U - B) g ai) is also reported. 



where dNf^{Lx\M„ z) is the observed number of AGN in each bin 
of X-ray luminosity, stellar mass and redshift and we have defined: 

j 'Wagn 

dN^M t ,z) = — — — — — - V Ar i (i x , i |M»,z). (8) 

dN AGN (L x \M,,z) jrf 

Nj(Lx\M t ,z) is the number of galaxies that could have been 
detected as X-ray sources at their position (given the known expo- 
sure map of the XMM-COSMOS survey) if they had the luminosity 
Lx,i of the rth AGN in the given bin. 

Figure [14] shows the probability of galaxies hosting AGN, 
Pagni f° r different stellar mass bins in three redshift intervals. For 
lower mass points, completeness has been treated conservatively, 
i.e. we have assumed that the parent galaxy sample is complete only 



above the mass completeness li mit for quiescent g alaxies as func- 
tion of redshift, as reported in lllbert et alj J201CI> (this translates 
into mass limits completeness of about M, = 10 97 , 10 10 * 1 , 10 10 - 7 
for the three redshift bins, respectively). In those cases, our AGN 
fractions are considered as upper limits. 

The good statistics of our large, homogeneous and complete 
sample, allows for an accurate study of AGN incidence, which ac- 
coun ts for both redshi ft and intrinsic AGN luminosity dependence 
(see lAird et al]|2012h . In each panel, AGN with different intrin- 
sic X-ray luminosity are plotted with different colors. While the 
global trend with masses remains qualitatively the same in each lu- 
minosity bin, the normalization strongly decreases with luminosity, 
i.e. for a galaxy of a given mass the probability to host an AGN is 
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Figure 14. Probability of a galaxy to host an AGN of a given luminosity as a function of stellar mass in three redshift bins. Different colors show dif- 
ferent X-ray luminosity bins: green for log(L[2-iofa>y])=[42.8-43.5]; blue for log(Lp-io*eV])=[43.5-44]; purple for log(L[2-iofa>v])=[44-44.5] and red for 
log(L[2-ioicV])=[44.5-46]; the black points refer to the total fraction computed by summing all objects above the given luminosity thresholds. Upper lim- 
its correspond to the mass bins where the host galaxy parent sample is not complete, as determined by the completeness limits for quiescent galaxies in 
COSMOS reported in lllbert et al] feOld) . The thin dot-dashed line is t he prediction for th e total AGN fraction, computed in the same redshift bins, and above 
the same luminosity thresholds, from the "AGN occupation" model offfiore et al. 1 20 lj) while the dotted line in the first redshift bin is a modified version of 
it (see text for more details). 



higher as the AGN X-ray luminosity decreases and the same hap- 
pens at any redshift. This is an obvious consequence of the shape of 
the AGN luminosity function: more luminous AGN are rare, while 
less luminous ones are much more common. 

Very interestingly, as already shown bv lAird et al] d2012l) . the 
Pagn(^x|A^«,z) as a function of galaxy stellar mass do show very 
similar profiles at all sampled luminosities. AGN, of any intrinsic 
luminosity, are more common the more massive the host galaxy 
is. This is by no means a trivial fact: one could have easily ex- 
pected that more luminous AGN were to be found in more massive 
galaxies and less luminous AGN in lower mass galaxies. This, in 
fact, is the prediction of any model in which black hole growth in 
AGN phases occurs over relatively narrow Eddington ratio ranges, 
and the very different picture we observe in Fig. [14] (as well as in 
Fig. Q3] see below) is a strong indication that AGN of all masses 
should be cha racterized by a very bro ad distribution of Eddington 
ratios (see e.g. lMerloni & Heinzll2008l) . We quantify this statement 
by studying the AGN fractions as a function of the specific accre- 
tion rate in section |4.3.2| below. 

Figure [T4l also shows (black points) the total AGN fraction 
above a given luminosity thresholds in each redshift interval (i.e. 
computed by summing up the p A gn(Lx\M,,z) in different luminos- 
ity bins). As a comparison, the thin dot-dashed line is the predic- 
tion for the total AGN fraction, computed in the same redshift bins 
and above the same luminosity th resholds, from the "AGN occupa- 
tion" model o f lFiore et al.N2012l) . We refer the reader to that work 
for more details, but we note here that in iFiore et al] J2012h the 
AGN fraction is computed by deriving an active SMBH mass func- 
tion from the observed AGN luminosity function, assuming a given 
shape for the Eddington ratio distribution, and then comparing it to 
the observed galaxy mass function. In the two highest redshift bins, 



such a model reproduces our data remarkably well. However, se- 
vere discrepancies appear in our lowest redshift range. As we will 
show in the next section, it appears that the true Eddington ratio 
distribution of our AGN is better descr ibed by a simple p ower-law 
and the fact that the shape assumed in iFioreetal] J2012T) is a log- 
normal can explain the discrepancy. In fact, while in the highest 
redshift bins, where our COSMOS AGN sample probes just a lim- 
ited range of Eddington ratios, the observed distribution is hardly 
distinguishable from a log-normal distribution with appropriate pa- 
rameters, in the lowest red shift range, the log-normal shape as- 
sumed bv lFiore et al] {2012;) under- predicts the number of low lu- 
minosity AGN hosted in massive galaxies (10 106 < M, < 10 116 ). 
Assuming a log-normal distribution with a peak at lower Edding- 
ton ratio would already improve the prediction in the first redshift 
bin as shown by the dotted line in Fig. [14] which corresponds to a 
distribution with /l pcll k=0.03. 



4.3.2 AGN fraction as a function of specific accretion rate 

Indeed, by combining the available information (host galaxy stel- 
lar mass and AGN luminosity) we can study the probability of a 
galaxy to host an AGN as a function of mass in bins of "specific 
accretion rate", i.e., the ratio Lx/M, which gives an approximate 
estimate of the rate at which the SMBH grows. To the extent that 
a proportionality between the black hole mass and the (total) host 
galaxy stellar mass can be a ssumed (and this is far from being clear, 
see e.g. iMerloni et alfeoiOl) . this ratio could give a rough measure 
of the black hole Eddington ratio, defined as the ratio between the 
AGN X-ray luminosity and the Eddington luminosity. To be spe- 
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Figure 15. Probability of a galaxies to host an AGN per decade specific accretion rate (Lx/M,) as a function of stellar mass in three redshift bins. Different 
colors show different Lx/M, intervals, as explained in the third panel. Empty symbols with arrows show upper and lower limits due to the combined effec t 
of completeness in both L\ and M,. Upper limits without empty circles mark the bins where no AGN was found, and were placed using the Gehrels ( 1986) 
formulae for Poisson statistics. 



cific, the Eddington ratio can be expressed as: 



X 



1.3 x 10 38 M 



(9) 



where k ba] is the 2-10 keV bolometric correction, and the factor 31 
is a constant if the SMBH mass can be related to t he host galaxy 
mass through scaling relation s (withal ss 500-1000; lHaring & Rixl 
l2004lMagorrian et alJI 1998h . Thus, for a mean bolometric correc- 
tiorj/j of k bol = 25 and a constant host stellar to black hole mass 
ratio of 3\ = 500, a ratio of L x /M» = 10 34 would approximately 
correspond to the Eddington limit. 

Figure [T5l shows how the probability p A G^(Lx/M,\M,,z) of 
a galaxy to host an AGN of a given specific accretion rate scales 
with the stellar mass of the host. Here, of course, the issue of com- 
pleteness is far more complicated. Depending on the Lx/M, ratio 
interval considered, both the flux limit of the X-ray survey and the 
stellar mass completeness limit of the parent galaxy sample should 
all be taken into account. Figure [T5l shows that, at any given spe- 
cific accretion rate (a proxy, with all the caveats discussed above, 
of the Eddington ratio), the trend with mass almost disappears: the 
probability for a galaxy to host an AGN of a given Lx/M, is almost 
independent of its mass but scales only with the ratio itself. 

In Figure[l6] we then plot p AGN (L x / M,\M„ z) as a function of 
the specific accretion rate itself. As mentioned above, there is little 
dispersion among the points corresponding to different mass bins 
for the same Lx/M, interval. We can assume that the AGN fraction 
at any given value of the Lx/M, ratio is independent of host galaxy 
mass, and measure such a constant "universal" mean probability 
for every specific accretion rate and redshift bin. The thick black 
triangles in Fig. 1 161 show the mean values of p A Q^(Lx/M,\M,,z)- 



9 recent works iLusso et alj|2010t |201l"l) have shown that the 2-10 keV 
bolometric correction is likely an increasing function of the Eddington ratio 
itself, a further complication that we here neglect. 



Indeed, all the data seem consistent with the idea that, at every red- 
shift, there is a universal distribution function that describes the 
probability of a galaxy of any mass (above our completeness limit 
of ~ 10 10 Mq) to host an AGN of a giv en specific accre tion rate 
L x /M,, independent of host stellar mass l lAirdetalJl2012h . 

As we have anticipated, to the first order, the data are consis- 
tent with a power-law like distribution and a clear break at values 
of L x /M, consistent with the Eddington limit. In Table|2]we report 
the values of the best fit power-law distribution for each of the three 
redshift bins, in the form 10 !: 



p AGN (Lx/M,\M„z) = K^ 



(10) 



As compared to the values found in lAird et alj l l2012h , we find 
a somewhat steeper slope for the power-law index (« 1 rather than 
~0.7). We spec ulate that this mig ht have to do with the fact that, 
differently from lAird et al.1 J2012h . we do not include AGN from 
the deepest Chandra fields, that probe the fainter, shallower end of 
the X-ray luminosity function. Hint of such a flattening can be seen 
in the decline of the Pagn for the most massive galaxies at low X- 
ray luminosities in Fig [14] These are very low specific accretion 
rate objects for which we are not complete in the COSMOS sample 
(they are shown as lower limits in Figures 1 1 5 |and|T6l> and therefore 
cannot properly assess the robustness of any change in slope at low 

10 We note here that the fact that the obsened distribution of specific ac- 
cretion rates (or, for what matters, Eddington ratios) is typically found to be 
log-normal (see e. g. Kollmeier et al. 2006; Netzer 2009; Trakhtenb rot et al] 
|2011| ; ILusso et al.ll2012h ." rather than a power-law, is due to the combined 
effects of a flux limited survey, that depletes the number of low specific ac- 
cretion rates at progressively lower masses, and of the fact that very massive 
galaxies, where slowly accreting objects can be detected are exponentially 
rarer as the stellar mass increases. A thorough discussion of these selection 
effect on the de termination of the acretion r ate d istribution functio n of AGN 
can be found in lSchulze & Wisotzkil i2010|) and lAird et at] i2012l) . 
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Figure 16. The probability of a galaxy to host an AGN of a given specific accretion rate (per decade Lx/M, ratio) is plotted as a function of Lx/M. for three 
redshift intervals. Colored points, have the same coding of Fig. fTSI and plot the individual measurements, while black thick triangles are the mean values. The 
thick black line in each panel is a power-law fit to all the mean points obeying Log(Lx/M») < 34. For ease of comparison, in each panel we also show the best 
fit to the central redshift interval. The vertical dotted (dot-dashed) lines mark the approximate location of objects at the Eddington limit (or ten per cent of it) 
calculated assuming a constant bolometric correction &t,„i = 25 and a constant host stellar to black hole mass ratio of J?( = 500. 



Table 2. The best fit slope and normalization for the universal probability 
distribution function ppj^niLx/M,, z.) as a function of redshift for the entire 
X-ray selected sample, as shown in Fig. 1161 





all X-ray 


redshift range 


Log K y 


0.3 < z < 0.8 


29.50 0.96 


0.8 < z. < 1.5 


33.41 1.06 


1.5 < z, < 2.5 


33.46 1.04 



Lx/M,. The possibility that the Pagn(Lx/M,\M,,z) might break to 
a shallower slope at low values of the specific accretion rate could 
be tested, indirectly, by comparing the observed X-ray luminosity 
function of the AGN with a convolution of the observed galaxy 
mass function and the probability distribution p AGN (Lx/M,\M,,z)- 
We defer this investigation to a future work. 

Nevertheless, we see very clearly in Fig. [T6] that the 
Pagh(Lx/M,\M„z) normalization increases with redshift over the 
entire redshift range probed by our data. If we compute the value 
of the p A gn(Lx/M,\M,,z) at Log(L x /M,)=33.2, i.e. at a specific 
accretion rate approximately corresponding to « 10% of the Ed- 
dington rate, and where our data best constrain the distribution at 
all redshifts, we find that it evolves with redshift as fast as the spe- 
cific star formation rate of the overall population of galaxies (both 
star-forming and quiescent, see iKarim et allioTTh oc (1 + z) 4 . 

This very importa nt fact, already po inted out, over a more lim- 
ited redshift range, bv lAirdetal] fe012h . implies that the rapidity 
with which a black hole grows is not influenced by the host galaxy 
mass, while, on the other hand, is a strongly evolving function of 
redshift, with a redshift dependence that follows the overall evolu- 
tion of the specific star formation rate of the galaxy population. This 
also means that the trend with the hosts' mass visible in Fig.ll4land 



widely confirmed by many authors before us, is a trend induced by 
the fact that in more massive galaxies we are able to detect AGN 
down to a lower intrinsic Eddington ratio, while in less massive 
galaxies only rapidly accreting black holes are bright enough to be 
observable, and their total number is therefore much smaller. 



4.4 The incidence of AGN as a function of the star-forming 
properties of their hosts 

Studi es in the local Universe dBrinchmann et al]|2004klSalim et alj 
1 20051) have pointed out the existence of two main galaxy popula- 
tions: star-forming galaxies, which are currently forming stars at 
a rate that was found to be proportional to their mass, and quies- 
cent galaxies, which are massive galaxies in which the star forma- 
tion has been quenched*"*! The specific star formation rate (sSFR) 
can be defined as the ratio, sSFR=SFR/M„ of (total) star formation 
rate to stellar mass of a galaxy. The inverse of the sSFR defines a 
timescale for the formation of the stellar population of a galaxy, f SF 
= sSFR~ 1 , which gives the time it would take a galaxy to double its 
stellar mass if forming stars at the observed rate. 

In the sSFR vs mass plane, star-forming galaxi es can be di- 
vided into a star-forming sequence ("main sequence" iNoeske et alj 
1 20071) characterized by an almost constant (or slightly decreasing) 
sSFR as a function of stellar mass, and a high mass region with 
progressively decreasing sSFR, the "dying sequence", occupied by 
quiescent galaxies. In evolutionary terms, such a distribution is ob- 
served, almost unchanged, at any redshift, with the star-forming 



"Quenching", regardless of its physical cause, is the process that cause 
the cessation of star formation in some star-forming galaxies, which 
leads to the emerge nce of the so-called red-sequence of passive galaxies 
IStrateva et al.l200ll) . 
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Figure 17. Specific star-formation rate versus stellar masses in three redshift bins for the host galaxies of type-2 AGN compared to normal galaxies from 
the i-band-apparent-magnitude-matche d sample (gray points). T he solid lines are the correlations found for blue star-f orming galaxies: t he purple one in the 
lowest redshift bin correspond s to z~0 iBrinchmann et all2004l) . the black one in the intermediate redshift bin to z~l |Elbaz et al. 2007|) and the blue one in 
the highest redshift bin to z~2 (Daddi et alj2007|). In each panel the z ~ 1 curve is reported in grey for reference. The cyan dotted horizontal lines correspond 
to the age of the Universe ?#(z) at the minimum and maximum redshift considered in each panel and indicate roughly the division between galaxies which are 
actively forming stars and quiescent. 



galaxies sequence maintaining the same slope but i ncreasing nor- 
malization as one mov e s towards higher redshift dNoeske et alj 
120071 : lElbazetalj|2007l ; iDaddi et al.l [20071 ; IPannella et alj|2009h . 
There are also more extreme objects, i.e. Starburst galaxies and Ul- 
tra Luminous Infrared Galaxi es (ULIRGs), which exh ibit highly 
elevated star formation rates dSanders & Mirabellll9960 in galax- 
ies within the same range of stellar mass of normal galaxies. Lo- 
cal UL IRGs are believed to be associated with rare m ajor merger 
events JSanders et aflll988t ISanders & Mirabellll996j) and conse- 
quently distinct star formation processes, but recent studies with 
the Herschel telescope demonstrated that they represent a minor 
contribution to the total sta r formation rate density in the crucial 
redshift range 1.4 < z < 2.5 l lRodighiero et alj201 lh . Recent works 
dWuvts etalj201lh have shown that a detailed morphological anal- 
ysis of galaxies as a function of their position in the sSFR-stellar 
mass plane can provide a unique insight on the physical processes 
that shape and grow galaxies, and how the various modes of star 
formation can be associated to them. 

In light of the above results, it is obvious that a systematic 
study of the distribution of AGN host galaxies in the sSFR-stellar 
mass plane can reveal key pieces of evidence regarding the long 
sought-after association between SMBH growth and galaxy for- 
mation. Figure [17] shows the position of our obscured (type-2) 
COSMOS AGN host galaxies in the sSFR vs Mass plane com- 
pared to the overall non-AGN galaxy population (gray dots). We 
have purposely avoided to include type-1 objects in this diagram, 
as optical/UV-based SFR measures are too strongly affected by the 
unobscured accretion disc light in this class of AGN. 

The purple, black and blu e solid lines show the po sition of 
the main sequenc e at z ~ fBrinchma nn et al] 120041) , z ~ 1 
jElbaz et al.ll2007l) and z ~ 2 l lDaddi et alj|2007l) . while the cyan 
dotted lines correspond to the age of the Universe tn(z) at the min- 



imum and maximum redshift considered in each panel. Galaxies 
of known redshift and sSFR can in fact be divided into "quies- 
cent" and "star-forming" on the basis of their stellar mass doubling 
timescale, fsF^ if ?sf < tn(z), where tn(z) is the age of the Universe 
at the redshift of the source, the galaxy is defined as star-forming; if 
tsF > fpi(z) the galaxy is defined as quiescent. The i-band-apparent- 
magnit ude matched paren t galaxy population from the S-COSMOS 
survey fabert et alj2O10h does indeed show a broad distinction be- 
tween a star forming sequence and a dying sequence at the highest 
masses, which clearly evolves upwards (in sSFR) with increasing 
redshift. 

Obscured AGN hosts, to a first order, appear to populate the 
high-mass end of the distribution quite uniformly. We estimate that 
the fraction of AGN hosted in starburst galaxies (defined as in 
iRodighiero et alj[201 ll to be above 4 times the main sequence) is 
below 10% while the fraction of AGN hosted in main sequence 
galaxies range from 27% to 37% in the different redshift bins. The 
rest of the AGN (58%-66%) are found in quiescent galaxies. Con- 
siderin g only the most luminous sources as done by Mainieri et alj 
( l201ll) . we found that at z~l about 62% of Type-2 QSOs were 
actively forming stars (i.e. hosted in starb u rst an d main sequence 
galaxies). As comparison, iMullanevetal] fe012h . using the FIR- 
Herschel data in the GOODS field and excluding the galaxies in 
which a strong contribution from the AGN to FIR wavelengths was 
evident, estimated for moderate luminous AGN the following per- 
centages: < 10 % of AGN are hosted in strongly starbursting galax- 
ies; 79% in main-sequence galaxies, and 15% in quiescent galaxies. 
The discrepancy between the two findings could be attributed to the 
differences (especially at low SFR) found between the SFR derived 
from SED fitting and the FIR ones as showed in Appendix [B] To 
test this in detail, we fit below and above 20 M Q /yr~' the SFR SED - 
SFRir relation found (see left panel of Fig. IB lb and we assign to 
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Figure 18. Specific star formation rate distributions divided in three redshift bins and three mass ranges for type-2 AGN host galaxies (red histogram) and 
normal galaxies (gray histogram). Dot-dashed vertical blue lines are used to distinguish quiescent from star-forming galaxies based on the ratio between the 
mass doubling time fsF and the age of the Universe in any redshift bin. The upper panel of each histogram shows the observed AGN fractions. 



each of our SFR values, the corresponding SFR, R . We now find that 
the fraction of AGN hosted in strongly starbursting galaxies is 9%, 
while 67% are hosted in main-sequence galaxies, and 24% in qui- 
escent galaxies, in much better agreement with the Herschel-based 
results. 

Figure Q~8] shows, as histograms, the sSFR distributions of 
AGN hosts and normal galaxies in bins of stellar mass and red- 
shift. Once again, the AGN do appear to follow the evolution of 
the specific star formation rate with stellar mass and redshift, with 
just a marginal increase in the AGN fraction towards lower values 
of sSFR in almost all bins, indicating a slightly higher incidence of 
(obscured) AGN in non-star-forming galaxies. 

According to the models that assu me that major merge rs are 
the main AGN fueling mechanism (e.g. iHopkins et alj|2008l) . one 
could have expected an enhancement of the SFR activity in ob- 



scured AGN, which is however not visible. From our analysis, there 
is no clear evidence that the COSMOS obscured AGN hosts are 
preferentially found among the extreme, most rapidly star-forming 
galaxies. 

To analyze this in further detail we divide the host galaxies 
in two classes depending on the ratio between fsF and fn. Accord- 
ing to this division of the total sample of obscured AGN discussed 
here, the fraction of quiescent hosts (always computed for stellar 
masses above the c ompleteness limit of the COSMOS-IRAC sur- 
vey, as discussed in lllbert et alj[20loh are 75%, 65% and 61% in 
the redshift bins 0.3-0.8, 0.8-1.5 and 1.5-2.5, respectively. In to- 
tal, about two thirds of the obscured AGN in COSMOS that we find 
in massive galaxies are hosted by objects with stellar mass doubling 
time longer than the age of the Universe. These fractions are higher 
than those of the galaxy sample, where the fraction of quiescent 
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Figure 19. The average probability of a galaxy to host an AGN of a given specific accretion rate (per decade Lx/M, ratio) is plotted as a function of Lx/M, 
for three redshift intervals. In each panel blue filled circles are for star-forming galaxies (i.e. those with ?sf < 'h) and red squares are for quiescent ones 
(those with ?sf > tu). Solid lines of corresponding colors show the power-law fit to all points obeying Log(Lx/M.) < 34. The grey symbols and lines are 
the probability distribution functions for the overall galaxy population (for all AGN types), as from Fig. 1161 The vertical dotted (dot-dashed) lines mark the 
approximate location of objects at the Eddington limit (or ten per cent of it) calculated assuming a constant bolometric correction k\, i = 25 and a constant host 
stellar to black hole mass ratio of = 500. 



hosts are 51%, 41% and 32% in the redshift bins 0.3-0.8, 0.8-1.5 
and 1.5-2.5, respectively. Such a predominance of quiescent (or, 
better, not strongly star forming hosts) is indeed consistent with 
the predominantly red colors exhibited by the AGN hosts that we 
discussed in section PTTI 

In section l4.3.2l we have computed the probability of a galaxy 
(above the stellar mass completeness threshold of the COSMOS- 
IRAC survey) to host an AGN of a given specific accretion rate, 
irrespective of the star-formation rate properties of the galaxies. 
For the sub-sample of AGN which are obscured, for which a re- 
liable and robust determination of sSFR is possible, we have also 
computed separately the probability of star-forming and quiescent 
galaxies to host a (type-2) AGN as a function of specific accre- 
tion rate Lx/M,. The results are shown in figure [19] where we 
also report, for the sake of comparison the overall p A GN 01 Fig.l 161 
Taken at face value, the plot shows that there is little significant 
evidence for a difference in the probability of hosting a type-2 
AGN in quiescent and star-forming galaxies, as the data points 
agree within the errors (that here are representative of the disper- 
sion of p A Gx(Lx/M,\M,, z) among the different stellar mass bins). 
The slight increase in the probability of hosting AGN in passive 
galaxies is hinted at in the highest redshift bin, while in the low- 
est one, star-forming galaxies appear to have a higher probability 
of hosting AGN with low specific accretion rate, but the number 
statistics is far too low to make any firmer statement. Even the size 
of our COSMOS sample is not large enough, once we subdivide the 
sources into spectral class, X-ray luminosity, stellar mass and star 
formation properties of the hosts. 



5 DISCUSSION 

Consistently with earlier studies, X-ray selected AGN are found to 
reside mainly in massive galaxies, while optically selected Seyfert- 
2 galaxies, which are also the low luminosity ones, seem to prefer 



galaxies with smaller masses. The total stellar masses of the AGN 
host galaxies in our sample range from 10 10 Mo to 10 1L5 Mo with a 
peak at 10 105 Mq. No difference is found between X-ray obscured 
and unobscured AGN which on average show the same mass dis- 
tribution. 

In accordance with their high stellar masses, AGN hosts are 
more likely to be found in galaxies with red colors. Many au- 
thors have pointed out that the trend with color was just a mass 
driven effect and that removing the mass dependency would make 
it disappear ^Silverman etai]l2009l ;l Xue et al 1 l2010l : (Mainieri et alj 



201 H iR osario et al. 201JJ). Indeed, several previous studies (e.g. 
Silverman et alj 20081) claimed that AGN were found mainly in a 



region of intermediate colors, the so called 'green valley'. This was 
interpreted as evidence of a close association between the migra- 
tion of galaxies from the 'blue cloud' to the 'red sequence' and the 
feedback from active nuclei. However, in most of these works, the 
AGN component was not subtracted in the computation of the host 
colors, and the dust extinction correction could not be properly as- 
sessed, leading to a possible blue contamination from the AGN and 
a higher incidence of AGN in the blue cloud and green valley. These 
two effects could explain our findings. In fact, in disagreement with 
the above studies, we find that compared to normal galaxies of the 
same mass, galaxies hosting an AGN are on average (slightly) red- 
der and form stars less rapidly. It is only by properly taking into 
account both the dust extinction properties and the contamination 
from the nuclear light of the entire galaxy population that we can 
assess the true incidence of AGN as a function of star formation in 
their hosts. However, we cannot exclude that there is a difference 
in color between low luminosity AGN and intermediate-high lumi- 
nosity ones and that this differences are, at least partially, respon- 
sible for the discrepancy between our result and previous studi es 
based on deep fields (e.g. lSilverman et alj20 08: Aird et alj|2012r) . 

In agreement with the standard unified model dAntonuccl 

1 19931) and contrarily to what (maybe naively) expected from the 
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most popular models of AGN fueling and feedback dHopkins et alj 
l2008h . and with a number of caveats regarding the robustness of 
color determination in type-1 AGN, we do not find any signif- 
icant difference between the host colors of o bscured and unob- 
scured AGN. We note here that recent works bv lPierce et ailfeoiol) 
and iRosario et alj d201 ill found possible evidence of a weak anti- 
correlation between the color of the AGN hosts and the hydrogen 
column density of X-ray obscuring gas A'h, in the sense that red- 
der AGN seem to have on average higher values of A'h- As our 
classification into obscured and un-obscured AGN is a complex 
one, and mainly based on optical spectroscop y, it is not s t raight- 
for ward to compare our results with those of IPierce et alj J2010h 
and lRosario et alj l l201lh . but a common wisdom is emerging that 
(at least mildly) obscured AGN are not to be found among the most 
active star-forming galaxies. 

When looking in detail at the relationships bet ween AGN 
lumino sity and host stellar masses, in agreement with I Aird et alj 
d2012l) , we find that the probability to host an AGN of a given spe- 
cific accretion rate Lx/M, (a proxy for the SMBH Eddington ratio) 
is practically independent of the galaxy stellar mass, but depends 
strongly on redshift and on the specific accretion rate itself, i.e. 
there is a strongly decreasing probability to host an AGN with in- 
creasing Lx/M, ratio. 

This also means that the higher AGN fraction is observed 
in massive red galaxies simply because it is easier to detect the 
more prevalent, low Eddington ratio AGN in more massive galax- 
ies, which are more likely to host the more massive SMBH. 

The specific accretion rate distributions we can indirectly in- 
fer from the observed probability of a galaxy to host a AGN is 
better described by a power-law function, the slope and normaliza- 
tion of which appear to increa se with redshift. We note here that 
iKauffmann & HeckmarJ <2009l) . studying the present-day growth 
rates of black holes in SDSS galaxies using the [OIII] luminosity 
as a tracer of AGN accretion activity and stellar masses as tracer 
of the BH masses, pointed out the existence of two trends in the 
Eddington ratio distribution as a function of the stellar population 
(a 'power-law' and a 'log normal'). These observed trends have 
been interpreted by them as the indication of two distinct accre- 
tion "modes" for black holes: the log-normal distribution identifies 
a mode in which the supply of gas is ample ("feast"), while the 
power-law distribution corresponds to a mode where accretion is 
limited by the available suppl y of gas ("famine"). Another inter- 
pretation has been proposed bv lHopkins et alj ( 120081) , who identify 
the origin of the bi-modality in the evolution of triggering rates 
(mergers) in combination with a luminosity-dependent AGN life- 
time. Our result shows that at higher redshift the picture might 
be different, as we do not find clear sign of a log-normal compo- 
nent in the specific accretion rate distribution, even when divid- 
ing the population of type-2 AGN hosts into actively star-forming 
and quiescent. This difference could be indicative of an interest- 
ing evolutionary effect to be further investigated, although we can- 
not e xclude possible selection effect due to the fact that the work 
from IKauffmann & HeckmarJ J2009D is based on a pure optically 
selected sample for which the AGN luminosity is computed based 
on the highly uncertain narrow line [OIII]-to-bolometric luminosity 
conversion. Nonetheless, the apparent universality of the observed 
probability distribution Pagn(Lx/M,\z) suggests that, at least in the 
redshift range populated by our COSMOS sample, the same physi- 
cal processes might be responsible for triggering and fueling AGN 
activity in galaxies of different masses. 

We also find that the overall probability to host an AGN of 
any given Lx/M, (as determined by the distribution normalization) 



increases rapidly with redshift as (1+z) 4 . Such an evolution can be 
due to either an increase in the number of sources accreting (ver- 
tical shift) or a shift of the characteristic Eddington ratio distribu- 
tion towards higher values (horizontal shift). The second possibility 
would also explain the "AGN downsizing" (the comoving number 
density of high lumin osity objects peaks at higher redshifts than 
low-luminosity AGN. lLa Franca et alj|2005t lHasinger et"al]|2005l : 
iBongiorno et alj|2007l) as a consequence of the different Eddington 
ratio distribution at different redshift. 

The fact that the probability of a galaxy to host an AGN of a 
given specific accretion rate evolves with redshift in the same way 
as the overall evol ution of the specifi c star formation rate of the 
galaxy population jKarim et alj|201 lh strongly suggests that AGN 
activity and star formation are globally correlated, at least in a sta- 
tistical sense. From the physical point of view, the drop with time 
in the global star formation rate density reflects a drop in the avail- 
ability of cold gas in galaxies to fuel star formation and potentially 
AGN. In that respect, AGN seem to follow the evolution of the 
galaxy population as a whole, a possibility reinforced by our de- 
tailed study of the position of AGN hosts in the sSFR-stellar mass 
diagram (section|4j4}. 



6 SUMMARY AND CONCLUSION 

In this paper we studied a sample of ~ 1700 obscured and unob- 
scured AGN, selected in the COSMOS field using X-ray and op- 
tical (spectroscopic) criteria. Of these, 602 are unobscured type-1 
AGN (430 with spectra) and 1 100 obscured type-2 AGN (549 with 
spectra) with Lboi ranging from 10 43 to 10 47 erg s -1 . Thanks to the 
wide COSMOS multi-wavelength coverage we were able to study 
in detail the source SEDs and to fit the observed fluxes with a two 
component model based on a combination of AGN and host-galaxy 
templates, accounting for dust extinction in both components inde- 
pendently. This method allowed us to decompose the entire spectral 
energy distribution into a nuclear AGN and a host galaxy compo- 
nent for almost all the sources and to derive robust measurements of 
both the AGN and the host galaxy properties (i.e. rest-frame colors, 
masses and SFRs). A test considering 1-component (galaxy) versus 
2-components (galaxy+AGN) SED fitting technique, discussed in 
Appendix lAl revealed that while considering only the galaxy com- 
ponent can be used to roughly estimate the blue colors of obscured 
AGN hosts, caution has to be used when deriving red magnitudes 
(e.g. Mjf) and physical parameters, e.g. stellar mass and SFRs. On 
the contrary, for unobscured AGN hosts no study should be per- 
formed without properly taking into account the AGN component; 
even with this caution, we find that for unobscured AGN while total 
Stella masses of their host can be determined quite reliably, start- 
formation rate are too affected by nuclear light contamination to be 
deemed robust. 

Our main findings can be summarized as follows: 

• COSMOS AGN host galaxy masses range from 10 10 M G to 
10 IL5 Mq with a peak at « 10 109 Mq. No significant difference is 
found in the stellar mass distributions of X-ray obscured and unob- 
scured AGN hosts. Only optically selected Seyfert-2 galaxies with- 
out X-ray counterpart populate the tail in the distribution at lower 
masses down to 10 85 Mq. 

• Host galaxies of obscured AGN span a very wide range of 
SFRs, with X-ray selected AGN populating the entire range, while 
optically selected Seyfert-2 galaxies being hosted mainly in galax- 
ies with low SFRs (and very low masses). As extensively discussed 
in Appendix IB] we find that above ~ 20Mq yr~' the SFRs derived 
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from the SED and the one derived from the Herschel FIR observa- 
tions are in good agreement. At low SFR (< 20Mq yr _I ), however, 
the disagreement becomes evident, with our SED fitting method 
indicating substantially lower SFR than the FIR one. 

We interpret this discrepancy as the possible combination of two 
effects: (i) contamination from star-formation-related emission in 
the NIR not properly taken into account in the SED fitting and (ii) 
AGN contamination in the FIR bands not taken into account in the 
FIR SFR estimates. We demonstrated that at least for the XMM- 
COSMOS sample, which is dominated by high luminosity AGN, 
the driven effect is the AGN contamination on the 60//m luminosi- 
ties (see Appendix iBlfor e detailed description), which, on the con- 
trary, is low or even negligible for moderate and low luminosity 
AGN samples. 

• In contrast to what was found bv lCardamone et all J201oh , no 
clear bi-modality is seen in the color magnitude diagram of the 
COSMOS AGN hosts. At any redshift, the hosts of the AGN in 
our sample are mainly massive, redder galaxies. In many previous 
studies on AGN samples of much smaller size, AGN were found 
to have a higher incidence in star form ing galaxies of blue colors 
dSilverman eta71l2009l : IXue et alj[20loh . once mass matched sam- 
ples of active and non-active galaxies are considered. Even when 
accounting properly for the mass-color degeneracy, and when look- 
ing at mass-matched sample, we do not confirm these findings with 
our much improved statistics. Even at fixed stellar masses, well 
above the completeness limit of the COSMOS sample, we find 
a slight trend for AGN hosts to be redder (by ~ 0.1-0.2 magni- 
tudes) than the non active galaxies. We argue that the main rea- 
son for such a discrepancy is the proper subtraction of the AGN 
light from our SED decomposition technique, complemented by a 
uniform and systematic treatment of dust extinction in both parent 
and AGN host galaxy sample. We caution, however, that degenera- 
cies in color-magnitude diagrams may easily arise due to alternative 
choices of spectral templates and residual issues with the AGN-host 
decomposition technique. 

• For a galaxy of any given mass above the stellar mass com- 
pleteness limit of the COSMOS survey, the probability to host an 
AGN of a given X-ray luminosity increases with stellar mass, and 
this holds true for any value of the X-ray luminosity we are sen- 
sitive to. In fact, against the naive expectation that more luminous 
AGN should be found in more massive galaxies and less luminous 
ones in lower mass galaxies, we find that AGN of any intrinsic lu- 
minosity are more common the more massive the galaxy is. 

• We have used the observable "specific accretion rate", i.e. the 
ratio between X-ray luminosity and host galaxy stellar mass as a 
proxy for the Eddington ratio of the growing black hole. By do- 
ing so we find that the probability for a galaxy to host an AGN of 
any given Eddington ratio (or specific accretion rate) is roughly in- 
dependent of the host galaxy stellar mass, but strongly decreases 
with increasing Lx/M„ and we also found a clear evidence of a 
break in the specific accretion rate distribution at values consis- 
tent with the Eddington limit. These results imply that the higher 
incidence of AGN observed in massive galaxies by essentially all 
multi-wavelength surveys is just a consequence of the fact that low 
specific accretion rate objects are more common than high specific 
accretion rate ones, and, at the same time, low Lx/M, objects in low 
mass galaxies drop out of flux-limited AGN samples. This effect, 
closely associated to the broad Eddington ratio distribution of AGN 
was anticipated in Merloni & Heinz (2008) and has been confirmed 
observationally by |Airdetal] < l2012l) . 

• We found that the probability for a galaxy to host an AGN 
of any given specific accretion rate strongly increases with red- 



shift, approximately as (1+z) 4 for SMBH accreting at about 10% 
of the Eddington limit, where most of the SMBH mass assembly is 
supposed to take place. Such a very strong evolutionary trend fol- 
lows closely the overall ev olution of the speci fic star formation rate 
of the galaxy population jKarimetalJl201ll) . This is yet another 
strong indication that AGN activity and star formation are globally 
correlated. 

• To study in detail the star formation properties of the COS- 
MOS AGN hosts, we have looked at the type-2 (obscured) AGN 
only, in order to avoid any bias/contamination from the inaccu- 
rate subtraction of the dominant blue nuclear light in type-1 ob- 
jects. Type-2 AGN hosts have on average the same, or slightly 
lower SFRs than non active galaxies of the same mass and at the 
same redshift. We find that the fraction of AGN hosted in strongly 
starbursting galaxies is very low (<10%) while is higher for main 
sequence galaxies (27% to 37%) and q uiescent ones (58%-6 6%). 
Compared to Herschel-bussd results (e.g. iMullanev et al.l2012h . wc 
find a much higher fraction of quiescent host. We tested that the dis- 
crepancy between the two results is explained by the differences in 
the SFR (especially below 20 M Q yr ) derived from SED fitting 
and FIR which have been attributed to the combination of the two 
opposite effects explained above. 

The probabilities of star-forming or quiescent galaxies to host a 
type-2 AGN also decline as a power-law as a function of specific 
accretion rate and there are tantalizing hints of difference in the 
two populations and in their redshift dependence. However, more 
detailed investigations of the incidence of AGN in galaxies as a 
function of redshift, stellar mass, star-formation rate and nuclear 
luminosity are hampered by the limited statistics of the sample. 
Wide area surveys probing volumes at z > 1 comparable to that 
explored by SDSS in the local Universe will be needed to properly 
address the fundamental questions on the physical nature of the 
AGN-galaxy co-evolution. 

The most striking piece of evidence that emerges from our 
study is the fact that AGN hosts do follow quite closely the overall 
evolution of the stellar build-up of the parent galaxy population 
and populate both star-forming and more passive galaxies almost 
equally. 

We argue that, on the basis of the evidence presented in this 
work, it is hard to find strong indication that AGN play a direct role 
in shaping the global properties of their host galaxies, or their evo- 
lution. The same, probably stochastic, process of AGN activation 
and triggering seems to be in place in galaxies of all masses, with 
a slight predominance in red ones with specific star formation rates 
below the nominal threshold that distinguish actively star-forming 
from quiescent galaxies. Because of the robustness of this picture 
across the wide redshift range probed, we believe that whatever 
physical process is responsible for triggering and fueling AGN ac- 
tivity, it must be the same between z ~2.5 and z ~0.3, but must de- 
crease in frequency or shift towards lower accretion rates. Although 
AGN activity and star formation appear to have a common trigger- 
ing mechanism, we do not find strong evidence of any 'smoking 
gun' signaling powerful AGN influence on star forming galaxies 
SFR. The extent to which this lack of evidence can be meaning- 
fully used to constrain and/or rule out theoretical models of AGN 
feedback, remains to be investigated. 
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APPENDIX A: 2-COMPONENTS VERSUS 
1-COMPONENT SED FITTING 

In this section we will discuss the differences between the measured 
galaxy parameters that one obtains when using the SED fitting with 
two components (AGN and galaxy) compared to what would be 
obtained ignoring the AGN component, as in other studies (e.g. 
ISilverman etaLll2009l : iRosario et alj|201 ]]). The color code for the 
symbols in the figures is the same as used throughout paper. 

Al Rest-frame magnitudes and galaxy colors: 

Rest frame magnitudes in any given band are computed integrat- 
ing the rest-frame galaxy template found as best fit solution in the 
corresponding filter. They are thus related to the "type" of galaxy 
chosen as best fit (which can introduce a difference up to 4 mag- 
nitudes) and, more important, to the normalization of the template, 
which is directly related to the total flux emitted by the source. In 
case of obscured AGN, the total emission in the U- and B-bands 
(/l e fj ~ 3800A - 4500A) is hardly influenced by the nuclear emis- 
sion which is, at these wavelengths, partially or entirely absorbed 
(see lower panels Fig. [3j. For this reason the shape of the chosen 
galaxy at these wavelengths and the normalization of the template 
should not strongly change when considering (or not) the AGN 
component. In order to test the effect of using one or two com- 
ponents fitting, we re-run our code using only galaxy templates. 
As shown in Fig. IA1I the U and B rest frame host galaxy mag- 
nitudes of obscured AGN with (e.g. U[SED agn+G al]) an d without 
(e.g. U[SED G al]) the AGN component lie on the one-to-one re- 
lation with a roughly symmetric scatter of about 0.4 magnitudes 
(red, yellow and green symbols). On the contrary, the correlation 
is much worse for unobscured AGN (blue circles) whose U- and 
B-band rest frame magnitudes with and without AGN can be up 
to more than 2 magnitudes different. Moreover, the spread is not 
symmetric and not using the AGN component always leads to a 
much brighter galaxy. This is due to the fact that unobscured AGN 
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Figure Al. Comparison between the rest frame U-, B- and K-band magnitudes of our AGN host sample obtained using the 2-component SED fitting presented 
in this work versus the one obtained using a standard SED fitting considering only the galaxy component. 
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Figure A2. Comparison between the SED fitting parameters t age /T obtained 
using the 2-component SED fitting [GAL+AGN] presented in this work 
versus the one obtained using a standard SED fitting considering only the 
galaxy component. 



emission is strong in the U and B bands (see upper panels of Fig. 
[3}, i.e. not taking into account the AGN component corresponds to 
additional attributed flux to the galaxy hence resulting in a overes- 
timation of the galaxy luminosity. The conclusion is different if we 
look at the comparison in the K-band (A^ ~ 2.2 x 10 4 A). Contrar- 
ily to the U- and B-band, at these wavelengths the contribution of 
AGN emission can be relevant even in the obscured sources (see 
e.g. the bottom right panel of Fig. [3}- For this reason, the K-band 
magnitude of both obscured and unobscured AGN hosts will be 
overestimated (i.e. up to 2 magnitudes brighter) if we do not take 
into account the AGN component. 

The objects whose host luminosities are less affected by the 
presence of an AGN component are the optically selected type- 
2 AGN without X-ray emission (triangles). For these objects the 
obscuration must be very high in order to suppress the AGN emis- 
sion up to the K-band wavelengths. As suggested is Sec. 13.61 these 
sources are likely Compton-thick AGN and hence a high level of 
obscuration is expected. 



A2 Host galaxy stellar mass and SFR 

Apart from the rest-frame luminosity in various bands, the other 
two important parameters we derive from the SED fitting are the 
stellar masses and the star formation rates. Besides the dependence 
on the template normalization, these physical parameters are linked 
also to the principal parameters used in the SED fitting, i.e. the e- 
folding time (t), the age since the beginning of star formation (t age ), 
and to some degree the extinction (E(B-V)). All SFHs considered 
in this work have an exponentially decreasing shape with 5 FR oc 
g-'age/r (buj on6j w hich is constant). For such an exponentially de- 
creasing SFH, a large value of t flgf /r implies a quasi passive evolu- 
tion with a negligible ongoing SF and tha t the majority of the stars 
in the galaxy are old dGrazian et alj|2007h . 

Fig. I A2I shows the difference in the t age /r ratio when consid- 
ering or not the AGN component in the SED fitting. In almost all 
cases, not considering the AGN component results in lower values 
of t a g e /T, which directly translates into galaxies with higher star 
formation rates, as visible in the right panel of figure |A3l The right 
panel of this figure shows, for obscured type-2 AGN in the COS- 
MOS sample, the ratio between the SFR obtained without the AGN 
component and once the AGN component is taken into account. 
For about 40% of the objects, the two values agree within a factor 
of 2, while for the rest of the objects the SFR estimated without 
AGN component is up to more than 20 times higher than the one 
obtained including the AGN. 

Finally, the measured stellar masses show a non-negligible 
spread in both directions, for both obscured and unobscured AGN. 
As showed in the left panel of Fig. I A3 1 the masses computed not 
including the AGN component range from 6 times higher (2 times 
if we consider only spectroscopic redshifts) to 10 times smaller (i.e. 
in unobscured AGN). 

Summarizing, for obscured AGN a standard galaxy SED fit- 
ting technique that does not include a nuclear component results in 
small differences in rest frame U and B absolute magnitudes (~0.4 
mag brighter) and larger ones for the K-band magnitudes (~0.8 mag 
brighter). The mass and SFR estimates are instead much more af- 
fected (up to a factor of 6 in mass, and SFR up to 20 times higher). 
This is not the case for optically selected type-2 AGN with no X- 
ray counterpart for which, due to the extremely high obscuration, 
the differences in magnitudes, mass and SFR are relatively small 
when considering (or not) the AGN component. For unobscured 
AGN on the contrary, all derived quantities (absolute magnitudes, 
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Figure A3. Comparison between the stellar masses and SFR of our AGN hosts obtained using the 2-component SED fitting presented in this work versus the 
one obtained using a standard SED fitting considering only the galaxy component. 



stellar mass and SFR) are strongly offset from the values obtained 
when taking into account the AGN component. 

In conclusion, while the SED fitting technique considering 
only the galaxy component can be used to roughly estimate the 
blue colors of obscured AGN hosts, caution has to be used when 
deriving red magnitudes (e.g. M K ) and physical parameters, e.g. 
stellar mass and SFRs. On the contrary, for unobscured AGN hosts 
no study can be performed without properly taking into account the 
AGN component. 



APPENDIX B: COMPARISON BETWEEN DIFFERENT 
SFR INDICATORS 

Evidences of star formation in galaxies can be seen in both the UV 
and IR bands. The UV emission is a direct measure of the starlight 
produced by a mixture of young (O, B and A type) stars. Using the 
UV emission to measure the SFR suffers from the limitation that 
part of the emitted radiation is absorbed, which is not taken into 
account. On the other hand, when using only the IR emission to 
constrain the SFR, the assumption is that a dominant fraction of the 
total bolometric stellar luminosity of the star-forming population 
is absorbed and re-radiated as thermal infrared dust emission. The 
most robust way to compute the SFR is thus the sum of both the 
unobscured UV star formation with the obscured one re-emitted in 
the IR by dust (SFR uv+ ir). 

Thanks to the advent of the Herschel Space Observatory, deep 
far-infrared observations of the COSMOS field are now available. 
Here, we use the deep COSMOS PACS 100 andJ60jmi obser- 
vations of the PACS Evolutionary Probe (PElf 1 ! ILuTz et alj|201 lh 
guaranteed time key program, reaching a 3<x li mit of ~ 4 mjy an d 
~ 10 mjy at 100 and 160 /jm, respectively (see lBerta et alJuoTlh . 
Using these observations we infer the obscured SFR of our host 
galaxies and test the accuracy of SFR derived from SED fitting. 

12 http://www.mpe.mpg.de/ir/Research/PEP 



Only a small fraction of our sources (~ 10%) are detected 
above the 3<x limits of the PEP catalogues. For these sources we 
computed the SFR using their in frared lum i nositie s and assum- 
ing the h IR to SFR conversion of iKennicut 3 dl998l) . i.e., SFR = 
10" 10 L« for a Chabrier IMF. The IR luminosity hm is derived 
from the monochromatic luminosity at the longest PACS wave- 
length availabl e conv erted using the infrared SED templates of 
ICharv & Elbazl fcOOll) as it has been prov en to provide reli able 
conversions over a broad range of redshift dElbaz et"al]|201 ll) . In 
the left panel of of Fig. IB II we show the comparison between the 
SFRs computed using the IR data and the one computed with the 
2-components SED fitting. As visible, the majority of the sources 
lie above the 1 to 1 relation with the SFR computed from the SED 
being lower than the one derived from the IR. This is driven by a 
selection effect due to the fact that, given a broad distribution of 
SFR//;/SFRs £0 values, the Herschel detected sources will be the 
most powerful ones corresponding by definition to the upper enve- 
lope of the SFRy/i/SFRi^o distribution. 

However, to test the accuracy of the SFR derived from SED 
fitting, we have to rely also on a statistical approach using a stack- 
ing analysis. Images of galaxies in a given redshift and SFR bin 
are stacked, allowing us to detect their typical far-infrared flux 
densities well below the detection limits of the current PACS ob- 
servations (the noise of the stacked images scales roughly with 
the square root of the number of the stacked sources). We start 
by separate our galaxies into the usual three redshift bins (i.e., 
0.3 < z < 0.8, 0.8 < z < 1.5 and 1.5 < z < 2.5). Within each 
redshift bin, we then separate our galaxies per SFR bins. The size 
of the SFR bins are optimized in order to obtain the highest number 
of bins with clear PACS detections (i.e., > 3<x). The first SFR bin 
is initialized with a size of 0.1 dex and an upper range equal to the 
highest SFR observed in the redshift bin. Then, we progressively 
decrease the value of the lower range of the bin (using steps of 0. 1 
dex), until we obtain a 3cr detection in at least one PACS passband. 
The lower range of the bin is then taken as the upper range of the 
next SFR bin and the same procedure is repeated in order to find the 
size of the new bin. The process is stopped once the lower range of 
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Figure Bl. Left: Comparison between the SFR derived from the SED fitting and the one derived from the FIR band for the sources detected in at least one FIR 
band. Open and filled squares corresponds to objects with photometric and spectroscopic redshift respectively. Right: Comparison between the SFR derived 
from the SED fitting and the one derived from FIR using stacked data (binning in SFR sc( j and redshift). Different colors correspond to different redshift ranges: 
0.3<z<0.8 black, 0.8<z<1.5 green and 1.5<z<2.5 blue. 



the current bin reaches the lowest SFR observed in the redshift bin. 
Photometry of the stacked images are me a sured using the PSF fit- 
ting analysis presented in iMagnelli et al.l d200°l 1201 ll) and using 
the relevant PSFs and aperture corrections of the COSMOS field 
jBerta et al]|201 lh . Errors are measured using a standard bootstrap 
analysis which takes into account both the photometric uncertain- 
ties of the stacked images and the intrinsic dispersion of the under- 
lying galaxy population. 

We derive the obscured SFR (i.e., SFRj R ) of each redshift 
and SFR bin using t heir stacked PACS flux densities and the 
ICharv & Elbazl feOQlh SED library. In the case where there is 
only one PACS detection (at 100 jim or 160 um), SFRir is ob- 
tained simply using the scaled Icharv & Elbazl 1 200 lh SED library. 
In the case where there are two PACS detections (at 100 jim and 
160^m), we fit the PACS flux densities with the Chary & Elbaz 
(2001) SED library leaving the normalization of each template as 
a fr ee parameter. In both c ases, SFR IR is given by integrating the 
best lCharv & Elbazl feOOlh SED template f rom 8-to-1000^m (Ltr) 
and using the s tandard SFR-Lir relation of lKennicuttl i ll 9981) for a 



1000 



IChabrierl Hool) IMF (SFR [M© yr 1 ] = 1 x 10- 10 Li R [L Q ]). 



In the right panel of Fig. IB 1 1 we compare for type-2 AGN 
the SFRs computed using the stacked IR data and the one com- 
puted with the 2-components SED fitting. Above ~20 Mq yr~' the 
comparison is good with the SFR derived from the SED fitting 
being slightly underestimated, while at low SFR (<20 Mo yr ) 
the disagreement becomes evident. This trend seems to be in con- 
trast with previous studies w hich, however , were base d on samples 
of nor mal galaxies. In fact, [Santini et al. | d2009h and IWuvts et alj 
d201 lh studying a sample of galaxies at high redshift (up to z~3) 
found an overall good correspondence between the SFR uv +ir and 
SFRsed up to intermediate SFR, while at the largest SFR (> 50Mq 
yr-1) the discr epancy was s i gnific ant and hence somewhat worri- 
some. However, IWuvts et alJd201lh showed that the deviation from 
a one-to-one relation between SFR uv+ ir and SFR S ed is most severe 
for galaxies that have a large SFRir/SFRuv ratio which is not the 
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Figure B2. Comparison between the SFR derived from the SED fitting 
without AGN component and the one derived from FIR using stacked data 
(binning in SFR sc( j and redshift). Different colors correspond to different 
redshift ranges: 0.3<z<0.8 black, 0.8<z<1.5 green and 1.5<z<2.5 blue. 



case for most of our sources (especially at high X-ray luminosities) 
for which the IR emission is dominated by the emission from hot 
dust heated by the AGN rather than young stars (see Sec. |3.6| and 
Fig.[8]l. However, for the sources for which this is not the case, we 
still assume that the IR band (~12yum at z ~1) is dominated by the 
AGN emission and this assumption can lead to underestimate the 
SFR attributing the IR emission due to obscured young stars to the 
AGN component. On the other hand, the IR estimation of the SFR 
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Figure B3. Mean total (AGN+SF) Lgo^m versus mean Lgo™, due only to 
the SF contribution obtained using a moc k catalog of galaxie s with differ- 
ent levels of SF and AGN contaminations ( Ro sario et al. 2012|), statistically 
weighed using the observed Lf,„/ distribution of our AGN sample in the 
usual three redshift bins. Error bars show the range in L6o,„„(SF) used in 
the computation of the mean. 

assume zero contamination from the AGN component in the IR 
and attribute the whole IR flux to the emission of the young stars. 
Again, this assumption, reasonable in most cases, can have the ef- 
fect of overestimating the SFR, and the effect would be higher at 
low SFR where the contamination from the AGN is a significant 
fraction of the IR emission. As explained below, we performed two 
kinds of tests to explore the above hypotheses. First of all, we com- 
pared the SFR obtained from the SED fitting without including the 
AGN component with the IR SFR (see Fig. |B3. We find that in 
this case there is a much better agreement between the two SFR 
indicators at low SFR. However, at high SFR we no longer have 
an agreement i.e. the SFR from the SED fitting is higher than the 
one derived from FIR band (Fig. lB2l l, The result of this compari- 
son indicates that the discrepancy seen in the right panel of Fig. lBTl 
can be partly due to the fact that the SED fitting method tends to 
overestimate the AGN component thus under-predicting the SFR. 

On the other hand, to test the AGN contamination to the FIR 
bands, we built a mock catalog of galaxies with different lev- 
els of SF and AGN contamination^ From this catalog and us- 
ing the computed weights, we calculated the average Lgo/mi due to 
AGN+SF as a function of the L 60 ^ m due only to the SF. We find that, 
if we include the lowest luminosity tail of the AGN distribution i.e. 
AGN with L/,„; down to 10 41 erg s -1 , there is hardly any difference 
between the ~L SF and L ror since the weighed mock is dominated 



by low luminosity AGN. On the contrary, if we restrict the com- 
parison to Lfc„; > 10 44 erg s , we find that the 60/j.m luminosity 
is contaminated by the AGN contribution especially at low L 60Alm - 
Since L«)/m is commonly used as a proxy for L w and, therefore, 
SFR, this plot can be directly compared with Fig. IB II right panel. 
As clearly visible, the two plots show a very good agreement in the 
overall shape. 

To perform a direct comparison with the observed trend in Fig. 
IB 1 I right panel, we used the same mock catalog as before which this 
time is weighed using the observed L/,„/ distribution of our sample 
in the usual three redshift bins. The result is shown in Fig. lB3l where 
the mean total (AGN+SF) L 60 ,,„> versus the mean h 60flm due only to 
the SF contribution are plotted. As clearly visible in the plot, the 
shape of the points in this figure and in our Fig. IB II right panel 
are in perfect agreement. This demonstrates that the trend observed 
when comparing SFR-IR and SFR-SED is mostly driven by the 
effect of AGN contamination on the 60/um luminosities. This effect 
is strong for the XMM-COSMOS sample, which is dominated by 
high luminosity AGN, but can be much smaller or even negligible 
when considering lower luminosity samples as demonstrated in the 
above test. 

We conclude that the discrepancy seen in the right panel of 
Fig. lB 1 l is due to the combination of two effects: (1) the SED fit ten- 
dency of overestimating the AGN component thus under-predicting 
the SFR and (2) the FIR-overestimation of the SFR especially at 
high AGN luminosities and low SFR, where the AGN contamina- 
tion in the IR band is not negligible as demonstrated by the tests 
performed using the mock catalog. 



APPENDIX C: AGN HOST GALAXY PARAMETERS 
DERIVED FROM THE 2-COMPONENT SED FITTING 

Here we report as example the first 16 lines of the tables published 
online. 

This paper has been typeset from a TpX/ ETpX file prepared by the 
author. 



intrinsic AGN luminosity at 60 /im was estimated using the mean SED 
(norm alized to the bolometric luminosity of the AGN) from lMullanev et al] 
bOl ll) and assuming a scatter around this value of 0.3 dex, which corre - 
sponds to the expected scatter in the SED shapes from lRosario et all j2012b . 
Since the simulation assumes that the AGN have a uniform distribution in 
AGN bolometric lum inosity, we used the bolometric luminosity function of 
iHopkins et al to derive statistical weights for the AGN sample de- 

pending on their L/, ; and we applied them to get the real AGN distribution. 
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Table CI. Sourc e identificationf the full version of the table is avai lable online). Columns are (1) XMM ID fro m jCappelluti et alj|2009h ; (2) zCOSMOS ID 
iLillv et alj|2009h in case the object is in the BPT selected sample iBongiorno et alj|2010l) ; (3) zCOSMOS ID jLillv et alj|2009l) in case the object is in the 
NeV selected sample (Mignoli et al. in prep); (4) photometric ID from lllbe rteTaiT feoOgl); (5) R A: (6) DEC; (7) I-band apparent magnitude; (8) spectroscopic 
redshift; (9) photometric redshift when the spectros copy is not availa ble iSalvato et al.ll2009h ; (10) Classification derived fro m the spectra: l=unobsc ured 
AGN, 2=obscured AGN classified from the X-ray jBrusa et al.ll2010l) . 22.2=obscured AGN selected using the BP T diagram {Bo ngiorno et al. 201o]) and 
22.4=obscured AGN selected through the [Ne v] line (Mignoli et al. in prep); (11) Classification derived from the SED jSalvato etall201 ll) in case the spectra 
are not available ( 1 =unobscured AGN, 2=obscured AGN). 



XMM-ID 


zC-ID(BPT) 


zC-ID(NeV) 


PHOT-ID 


O<J2000 


<5j2000 


Iab 


Zspec 


Zphot 


Class S p ec 


ClasSph t 


1 






786683 


150.10521 


1.981182 


19.12 


0.373 




1.0 


-99.0 


6 






767213 


150.17979 


2.110378 


18.34 


0.360 




1.0 


-99.0 


19 






1024197 


149.99362 


2.258538 


20.12 


0.659 




2.0 


-99.0 


26 






1627881 


150.39963 


2.687948 


18.25 


0.216 




2.0 


-99.0 


21 






1425464 


150.23079 


2.578183 


20.12 


1.403 




1.0 


-99.0 


63 


826475 




1067126 


149.78189 


2.139104 


18.97 


0.355 




2.0 


-99.0 


181 






751114 


150.26744 


2.055690 


24.93 




1.719 


-99.0 


2.0 


182 






1215214 


150.32085 


2.332960 


25.48 




2.030 


-99.0 


2.0 


184 






1453559 


150.22397 


2.550771 


22.26 




1.111 


-99.0 


2.0 


185 






808830 


149.99526 


2.006661 


21.90 




2.189 


-99.0 


1.0 




803488 




27241 1 


150.65016 


1.657098 


20.43 


0.587 




22.2 


-99.0 






803886 


286168 


150.52974 


1.725584 


20.93 


0.896 




22.4 


-99.0 






804237 


291946 


150.42996 


1.684480 


22.30 


1.000 




22.4 


-99.0 




804862 




303288 


150.27359 


1.778924 


22.44 


0.370 




22.2 


-99.0 






804431 


308347 


150.38304 


1.745181 


22.06 


0.702 




22.4 


-99.0 




805059 




311801 


150.23288 


1.724173 


22.36 


0.338 




22.2 


-99.0 



Table C2. Parameters derived from the SED fitting(the full version of the table is available online). Columns are: (1), (2), (3) as above; (4) extinction 
E(B-V) of the AGN component (5) rest-frame U-band magnitude of the host galaxy not corrected for extinction; (6) rest-frame B-band magnitude of the host 
galaxy not corrected for extinction; (7) host galaxy stellar mass; (8) flag indicating whether the previous value is a measurement (=0), an upper limit (=1) or if 
there is no measurement (=2); (9) host galaxy star formation rate; (10) flag indicating whether the previous value is a measurement (=0), an upper limit (=1) 
or if there is no measurement (=2 or =3); (11) extinction E(B-V) of the galaxy component. 



XMM 


zC(BPT) 


zC(NeV) 


^B-V 


M u' 

Ujtc 




log(M? al [M ]) 


f Mf" 


logfSFR^'pVIoyr 1 ]) 


f SFR gal 


C B-V 


1 






0.0 


-20.235 


-20.641 


9.238 





-99.0 


3 


0.3 


6 






0.1 


-21.655 


-22.406 


10.975 





-99.0 


3 


0.0 


19 






0.3 


-21.352 


-22.254 


11.158 





1.459 





0.2 


26 






5.6 


-20.108 


-20.983 


10.546 





0.364 





0.0 


21 






0.1 


-99.000 


-99.000 


10.537 


1 


-99.0 


3 


0.0 


63 


826475 




8.7 


-20.647 


-21.582 


10.994 





0.861 





0.1 


181 






0.6 


-20.417 


-21.728 


11.051 





1.628 





0.5 


182 






0.6 


-20.746 


-22.006 


10.984 





-0.152 





0.1 


184 






9.0 


-21.045 


-21.952 


10.866 





1.005 





0.1 


185 






0.0 


-23.113 


-24.295 


11.296 





-99.0 


3 


0.0 




803488 




9.0 


-21.056 


-22.238 


10.483 





-0.818 





0.0 






803886 


9.0 


-21.468 


-22.497 


10.979 





0.997 





0.1 






804237 


3.3 


-20.700 


-21.656 


10.454 





1.536 





0.4 




804862 




2.1 


-18.115 


-18.743 


8.7610 





-0.296 





0.0 






804431 


0.3 


-19.425 


-20.458 


10.325 





-0.15 





0.0 




805059 




0.3 


-17.459 


-18.166 


8.6090 





-0.622 





0.0 



